aws-sdk-android icon indicating copy to clipboard operation
aws-sdk-android copied to clipboard

Add Streaming speech-to-text support for aws-android-sdk-transcribe

Open AptFox opened this issue 5 years ago • 17 comments
trafficstars

Which AWS Services is the feature request for? aws-android-sdk-transcribe - AWS Transcribe Medical Streaming

Is your feature request related to a problem? Please describe. No, I'd like to use the Android SDK to convert speech to text on the fly. It seems like the SDK only supports a flow involving S3 buckets.

Describe the solution you'd like I'd like to use the Android SDK to convert speech to text on the fly.

Describe alternatives you've considered I'm currently attempting to access the streaming API directly using a websockets library. However, the signature 4 generation is a nightmare.

Additional context N/A

AptFox avatar Jan 15 '20 21:01 AptFox

+1

tao1 avatar Feb 07 '20 10:02 tao1

+1

jeromelesaux avatar Feb 19 '20 16:02 jeromelesaux

+1 hope it will integrated very soon :)

MacPower avatar Feb 20 '20 16:02 MacPower

Related? https://github.com/aws-amplify/aws-sdk-android/pull/1759

ryanfurness avatar May 26 '20 15:05 ryanfurness

I just looked through #1759 and it doesn't seem related to this request. Nothing in it enables streaming transcription.

AptFox avatar May 26 '20 15:05 AptFox

Hi @AptFox am also working on speech to text using aws transcribe i need your help can you help me please..

Suraj820 avatar Jun 07 '20 11:06 Suraj820

Hi @AptFox am also working on speech to text using aws transcribe i need your help can you help me please..

@Suraj820 I cannot.

AptFox avatar Jun 07 '20 12:06 AptFox

+1

hzbcdut avatar Jul 01 '20 09:07 hzbcdut

+1

VaqibChauhan avatar Feb 16 '21 20:02 VaqibChauhan

We are just getting started down this path and finding any docs on how to use this with Android is near impossible. We also want near real-time transcription meaning web sockets. This will be very short bits of text to transcribe - thinking 8 to 12 words where user is asking for various parts of a complex filter dialog to be filled in based on what they say. I have this working using the android.speech.SpeechRecognizer library.

There will also be an iOS version. We could use Google and iOS specific implementations but then each side would need to maintain the ML aspects of things too. Doing it through Amazon would allow us to have the ML in one place, it can be updated outside the apps, and would offer nearly identical results for both platforms.

The issue is development effort to get web sockets working for both sides. Would rather see this hidden by an Amazon library.

MKevin3 avatar Aug 09 '21 15:08 MKevin3

See if this helps you - last year after not having this solved and partnering with some of our AWS team: https://github.com/northwesternmutual/react-native-transcribe

ryanfurness avatar Aug 09 '21 16:08 ryanfurness

Any updates on this? and could you please update your docs, the android docs are a nightmare to be honest. I've been browsing your official site for hours but i can't find anything on how to use the android transcribe sdk.

taouichaimaa avatar Jan 28 '22 08:01 taouichaimaa

Since java library can do that (implementation "software.amazon.awssdk:transcribestreaming:2.20.0") do you recommend to use that instead of doing synchronous like temporal solution in: https://github.com/aws-amplify/aws-sdk-android/issues/2781 ?

I already implement google cloud speech recognition but we are facing the same issue as @MKevin3 we want to implement the same model in multiple platforms and we want to expand and create new languages

rogerr99s avatar Feb 10 '23 13:02 rogerr99s

Any news about that? Im getting this error when implementing the java one

type=1400 audit(0.0:58580): avc: denied { read } for name="somaxconn" dev="proc" ino=16755461 scontext=u:r:untrusted_app:s0:c58,c257,c512,c768 tcontext=u:object_r:proc_net:s0 tclass=file permissive=0

rogerr99s avatar Mar 06 '23 18:03 rogerr99s

Hi team Any Update on this Same issue for me thanks

ashirkhan94 avatar Jun 13 '23 12:06 ashirkhan94

Hi, Here is the transcribe streaming code I wrote for Android, you can try it. 🚀 https://github.com/sinyu1012/aws-transcribe-streaming-example-android

sinyu1012 avatar Jun 30 '23 12:06 sinyu1012