dify icon indicating copy to clipboard operation
dify copied to clipboard

Implement OpenAI's Realtime API

Open kuldeepdaftary opened this issue 1 year ago • 29 comments

Self Checks

  • [X] I have searched for existing issues search for existing issues, including closed ones.
  • [X] I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
  • [X] [FOR CHINESE USERS] 请务必使用英文提交 Issue,否则会被关闭。谢谢!:)
  • [X] Please do not modify this template :) and fill in all the required fields.

1. Is this request related to a challenge you're experiencing? Tell me about your story.

OpenAI recently released a new API for real-time voice conversations. As DIFY is a powerful RAG (Retrieval-Augmented Generation) building application, integrating this new API could significantly enhance its capabilities and user experience.

Implement support for OpenAI's Real-Time Voice Conversation API within DIFY, allowing users to:

  1. Conduct real-time voice conversations with AI models
  2. Integrate voice input/output with DIFY's existing RAG capabilities
  3. Potentially create voice-based chatbots or assistants

2. Additional context or comments

https://platform.openai.com/docs/guides/realtime Here is the API docs for our realTime API.

https://youtu.be/vXu2MZ7fp-I?si=mjFmAa4B7O06nDTy&t=231 here is how it's been used

3. Can you help us with this feature?

  • [ ] I am interested in contributing to this feature.

kuldeepdaftary avatar Oct 02 '24 13:10 kuldeepdaftary

that would be awesome ! looking forward to it !

remisharrock avatar Oct 03 '24 16:10 remisharrock

+1

taowang1993 avatar Oct 04 '24 01:10 taowang1993

+1

Mrhuang09 avatar Oct 08 '24 04:10 Mrhuang09

+1

Kevin9703 avatar Oct 08 '24 08:10 Kevin9703

+1

LeonKalt avatar Oct 08 '24 08:10 LeonKalt

+1

likenamehaojie avatar Oct 10 '24 03:10 likenamehaojie

Does anyone have any good ideas? Let's discuss how we can combine our efforts with the knowledge base. I am particularly interested in this aspect."

likenamehaojie avatar Oct 10 '24 13:10 likenamehaojie

+1 Very much expect!

jakcm avatar Oct 11 '24 11:10 jakcm

+1

maninder-ia avatar Oct 14 '24 02:10 maninder-ia

+1

ganeshbehera avatar Oct 17 '24 04:10 ganeshbehera

+1

lhyphendixon avatar Oct 17 '24 23:10 lhyphendixon

+1

aitorroma avatar Oct 18 '24 20:10 aitorroma

+1

Zane-Qbb avatar Oct 21 '24 09:10 Zane-Qbb

+1, I think implementing this feature fast will put Dify ahead of other RAG platforms

youssefsiam38 avatar Oct 22 '24 02:10 youssefsiam38

+1

wwwDESIGN-basti avatar Oct 22 '24 15:10 wwwDESIGN-basti

Hi @kuldeepdaftary , How's about this implementation? I really like to use this feature on my chatbot.

lhlong avatar Nov 01 '24 01:11 lhlong

Hi, @kuldeepdaftary. I'm Dosu, and I'm helping the Dify team manage their backlog. I'm marking this issue as stale.

Issue Summary

  • Proposal to integrate OpenAI's Real-Time Voice Conversation API into Dify for voice-based interactions.
  • Significant community support and enthusiasm for the feature.
  • @likenamehaojie suggested combining efforts with the knowledge base.
  • @lhlong inquired about the progress, indicating strong interest.

Next Steps

  • Please let us know if this issue is still relevant to the latest version of Dify by commenting here.
  • If there is no further activity, the issue will be automatically closed in 15 days.

Thank you for your understanding and contribution!

dosubot[bot] avatar Dec 02 '24 16:12 dosubot[bot]

push. its relevant.

firstcomeuropeag avatar Dec 02 '24 18:12 firstcomeuropeag

@takatost, the user @firstcomeuropeag has indicated that the proposal to integrate OpenAI's Real-Time Voice Conversation API into Dify is still relevant. Could you please assist them with this issue?

dosubot[bot] avatar Dec 02 '24 18:12 dosubot[bot]

I think so

ChnMig avatar Dec 19 '24 06:12 ChnMig

+1

yuhaowin avatar Jan 20 '25 11:01 yuhaowin

+1

ymshenyu avatar Jan 23 '25 10:01 ymshenyu

+1

xiaohanghu avatar Feb 07 '25 02:02 xiaohanghu

+1

edprodpo avatar Feb 17 '25 08:02 edprodpo

+1

Aaryan-Kapoor avatar Mar 03 '25 08:03 Aaryan-Kapoor

+1 any updates on this

abhijithem avatar Mar 05 '25 07:03 abhijithem

+1 期待此功能,下一代agent绝对需要全模态!

BruceLee569 avatar Mar 08 '25 12:03 BruceLee569

Hi, @kuldeepdaftary. I'm Dosu, and I'm helping the Dify team manage their backlog. I'm marking this issue as stale.

Issue Summary:

  • Proposal to integrate OpenAI's Real-Time Voice Conversation API for voice-based interactions.
  • Significant community support and interest in the feature.
  • Suggestions for collaboration with the knowledge base and inquiries about progress.
  • Recent affirmation of the issue's relevance by community members, prompting further engagement.

Next Steps:

  • Please confirm if this issue is still relevant to the latest version of the Dify repository by commenting here.
  • If no updates are provided, the issue will be automatically closed in 15 days.

Thank you for your understanding and contribution!

dosubot[bot] avatar Apr 08 '25 16:04 dosubot[bot]

yes still relevant

remisharrock avatar Apr 11 '25 00:04 remisharrock

+1

badereddineqodia avatar Apr 21 '25 10:04 badereddineqodia