swift-sdk icon indicating copy to clipboard operation
swift-sdk copied to clipboard

Support Streaming Audio Data via recognizeUsingWebSocket

Open jeff-arn opened this issue 5 years ago • 17 comments

When you open an issue for a feature request, please add as much detail as possible:

Currently, the interface exposed in SpeechToTextV1/SpeechToText+Recognize.swift only leaves a SpeechToTextSession alive for the time that it takes to transcribe a Data blob.

We should add support to send smaller chunks of data in realtime as a part of one session, to support streaming audio applications that are not driven via the microphone.

jeff-arn avatar Nov 18 '19 19:11 jeff-arn

This issue has been automatically marked as stale because it has had no recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Jan 19 '20 00:01 stale[bot]

@repjarms is this feature still on the roadmap?

digitallysavvy avatar Jan 20 '20 07:01 digitallysavvy

@digitallysavvy Yes the plan now is to get it out in a feature level release next week. While I am working on it, can you please provide some more details about the type of interface you are looking for.

For example,

Are you expecting to send an array of bytes, or configure AVAudioSession in a particular way? Any information about how this feature will be used will help me make sure I am delivering something that will address your needs.

jeff-arn avatar Jan 20 '20 18:01 jeff-arn

@repjarms Great news!

digitallysavvy avatar Jan 20 '20 22:01 digitallysavvy

@repjarms We're expecting to send an array of bytes. Agora's SDK passes a raw audio buffer, which I'm converting into a Data object. The existing API processes what I have correctly, minus the ability to stream continuously.

zontan avatar Jan 20 '20 22:01 zontan

@repjarms when @zontan was testing, he was able to send the Data in bursts but it was not a continuous stream (socket) in the way the WatsonMic has its implementation.

digitallysavvy avatar Jan 20 '20 22:01 digitallysavvy

This issue has been automatically marked as stale because it has had no recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Mar 20 '20 23:03 stale[bot]

@repjarms I realize there probably isn't an update on this but leaving a comment as to keep the issue from closing.

digitallysavvy avatar Mar 21 '20 02:03 digitallysavvy

This issue has been automatically marked as stale because it has had no recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar May 20 '20 05:05 stale[bot]

Still active

mediumTaj avatar May 20 '20 10:05 mediumTaj

This issue has been automatically marked as stale because it has had no recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Jul 19 '20 12:07 stale[bot]

This issue has been automatically marked as stale because it has had no recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Sep 19 '20 18:09 stale[bot]

@mediumTaj can we keep this active?

digitallysavvy avatar Sep 21 '20 14:09 digitallysavvy

This issue has been automatically marked as stale because it has had no recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Nov 25 '20 00:11 stale[bot]

This issue has been automatically marked as stale because it has had no recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Jun 18 '21 23:06 stale[bot]

This issue has been automatically marked as stale because it has had no recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Apr 16 '22 23:04 stale[bot]

is there still a chance to get access to the mic buffer?

digitallysavvy avatar Apr 20 '22 16:04 digitallysavvy