dify icon indicating copy to clipboard operation
dify copied to clipboard

tts output bug report

Open simplify123 opened this issue 8 months ago • 3 comments

Image Image Image

Self Checks

  • [x] This is only for bug report, if you would like to ask a question, please head to Discussions.
  • [x] I have searched for existing issues search for existing issues, including closed ones.
  • [x] I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
  • [x] [FOR CHINESE USERS] 请务必使用英文提交 Issue,否则会被关闭。谢谢!:)
  • [x] Please do not modify this template :) and fill in all the required fields.

Dify version

1.1.3

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

I have used the hjlarry/audio plugin to convert the responses generated by the LLM into audio output. However, after running the workflow, I encountered the following error: Run failed: plugin daemon: {"message":"Key: 'InvokePluginRequest[github.com/langgenius/dify-plugin-daemon/pkg/entities/requests.RequestInvokeTTS].Data.InvokeTTSSchema.Voice' Error:Field validation for 'Voice' failed on the 'required' tag","error_type":"PluginDaemonBadRequestError","args":null}, code: -400. The text-to-speech model deployed is Voice of CosyVoice2-0.5B (langgenius/xinference/xinference). In the Dify interface, under the Text To Speech node, the select a voice dropdown menu is not clickable and there is no list of voices available. CosyVoice2-0.5B is running via Docker deployment of Xinference. How can I resolve this issue?

Run failed: plugin daemon: {"message":"Key: 'InvokePluginRequest[github.com/langgenius/dify-plugin-daemon/pkg/entities/requests.RequestInvokeTTS].Data.InvokeTTSSchema.Voice' Error:Field validation for 'Voice' failed on the 'required' tag","error_type":"PluginDaemonBadRequestError","args":null}, code: -400

✔️ Expected Behavior

No response

❌ Actual Behavior

No response

simplify123 avatar Apr 04 '25 16:04 simplify123

Hello @simplify123! I'm Dosu, your friendly bot here to help you with bugs, questions, and becoming a contributor while you wait for a human maintainer. Let's tackle that TTS issue together! 🚀

There is a similar closed issue that might help you resolve the problem. The solution suggested was to ensure that the voice parameter is correctly handled and passed through the relevant functions and API calls. You may need to modify the code to include the voice parameter where necessary, such as updating the XinferenceModelExtraParameter class and related methods to include and handle the voice parameter. Additionally, ensure that your Dify Docker deployment is correctly configured to access the Xinference endpoint by using the machine's IP address instead of localhost [1].

To continue talking to Dosu, mention @dosu.


Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other

dosubot[bot] avatar Apr 04 '25 16:04 dosubot[bot]

same issue

haiker2011 avatar Apr 08 '25 09:04 haiker2011

I used 1.2.0 version,same issue

fanhuafeng avatar Apr 11 '25 02:04 fanhuafeng

same issue

91geek avatar Apr 15 '25 14:04 91geek

same issue,请问解决了吗

Fucheng-Wu avatar Jun 13 '25 03:06 Fucheng-Wu