dify tts output bug report

Self Checks

[x] This is only for bug report, if you would like to ask a question, please head to Discussions.
[x] I have searched for existing issues search for existing issues, including closed ones.
[x] I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[x] [FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:)
[x] Please do not modify this template :) and fill in all the required fields.

Dify version

1.1.3

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

I have used the hjlarry/audio plugin to convert the responses generated by the LLM into audio output. However, after running the workflow, I encountered the following error: Run failed: plugin daemon: {"message":"Key: 'InvokePluginRequest[github.com/langgenius/dify-plugin-daemon/pkg/entities/requests.RequestInvokeTTS].Data.InvokeTTSSchema.Voice' Error:Field validation for 'Voice' failed on the 'required' tag","error_type":"PluginDaemonBadRequestError","args":null}, code: -400. The text-to-speech model deployed is Voice of CosyVoice2-0.5B (langgenius/xinference/xinference). In the Dify interface, under the Text To Speech node, the select a voice dropdown menu is not clickable and there is no list of voices available. CosyVoice2-0.5B is running via Docker deployment of Xinference. How can I resolve this issue?

Run failed: plugin daemon: {"message":"Key: 'InvokePluginRequest[github.com/langgenius/dify-plugin-daemon/pkg/entities/requests.RequestInvokeTTS].Data.InvokeTTSSchema.Voice' Error:Field validation for 'Voice' failed on the 'required' tag","error_type":"PluginDaemonBadRequestError","args":null}, code: -400

✔️ Expected Behavior

No response

❌ Actual Behavior

No response

Apr 04 '25 16:04 simplify123

Hello @simplify123! I'm Dosu, your friendly bot here to help you with bugs, questions, and becoming a contributor while you wait for a human maintainer. Let's tackle that TTS issue together! 🚀

There is a similar closed issue that might help you resolve the problem. The solution suggested was to ensure that the voice parameter is correctly handled and passed through the relevant functions and API calls. You may need to modify the code to include the voice parameter where necessary, such as updating the XinferenceModelExtraParameter class and related methods to include and handle the voice parameter. Additionally, ensure that your Dify Docker deployment is correctly configured to access the Xinference endpoint by using the machine's IP address instead of localhost ^[1].

To continue talking to Dosu, mention @dosu.

^{Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other}

Apr 04 '25 16:04 dosubot[bot]

same issue

Apr 08 '25 09:04 haiker2011

I used 1.2.0 version，same issue

Apr 11 '25 02:04 fanhuafeng

same issue

Apr 15 '25 14:04 91geek

same issue,请问解决了吗

Jun 13 '25 03:06 Fucheng-Wu