dify icon indicating copy to clipboard operation
dify copied to clipboard

can't connect an audio file in speech to text tool

Open lucaseatp opened this issue 1 year ago • 1 comments

Self Checks

  • [X] This is only for bug report, if you would like to ask a question, please head to Discussions.
  • [X] I have searched for existing issues search for existing issues, including closed ones.
  • [X] I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
  • [X] [FOR CHINESE USERS] 请务必使用英文提交 Issue,否则会被关闭。谢谢!:)
  • [X] Please do not modify this template :) and fill in all the required fields.

Dify version

0.11.2

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

  1. connect the tool "speech to text" in your workflow
  2. try to create a variable for audio or video
  3. try to connect the custom variable or sys.files to speech to text tool
  4. tool doesn't show any variable available, can't move forward transcribing the audio

✔️ Expected Behavior

choose an audio/video variable to convert with whisper or other model

❌ Actual Behavior

can't choose any variable

lucaseatp avatar Nov 25 '24 01:11 lucaseatp

I found a similar unsolved discussion that might be relevant to your issue: Error in item: Invalid variable, which was last updated on November 21, 2024 [1].

Regarding your issue with connecting an audio file in the 'speech to text' tool, ensure that the audio file format is supported. The supported formats are flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, and webm [2]. If your file is in one of these formats and the issue persists, it might be related to how variables are being recognized or connected in the workflow. Unfortunately, I couldn't find specific details on variable connection issues in the current version. You might want to check if there are any updates or patches that address this problem.

To continue talking to Dosu, mention @dosu.

dosubot[bot] avatar Nov 25 '24 01:11 dosubot[bot]

image I find a bug for this issue: the hard code temp.mp3, use the code below? audio binary.name = file.get("filename") if file.get("filename") else "temp.mp3"

lanch avatar Nov 28 '24 09:11 lanch

Hi, @lucaseatp. I'm Dosu, and I'm helping the Dify team manage their backlog. I'm marking this issue as stale.

Issue Summary:

  • You reported a bug in the speech-to-text tool, unable to connect audio/video files due to missing variables.
  • I suggested checking file format compatibility and referenced a similar discussion.
  • Lanch identified a potential bug with hardcoded filenames and suggested a code modification.
  • The issue remains unresolved with no updates or patches provided.

Next Steps:

  • Please confirm if this issue is still relevant to the latest version of the Dify repository by commenting here to keep the discussion open.
  • If there is no further activity, this issue will be automatically closed in 15 days.

Thank you for your understanding and contribution!

dosubot[bot] avatar Dec 29 '24 16:12 dosubot[bot]