classifai Integrate Watson API for speech-to-text

Integrate Watson API for speech-to-text

Open jeffpaul opened this issue 5 years ago • 3 comments

Splitting this out from #2

Apr 08 '19 20:04 jeffpaul

Additional info:

The IBM Watson Speech to Text provides 3 interfaces for speech recognition:

The synchronous HTTP interface
The asynchronous HTTP interface
The WebSocket interface (recommended)

See more on Advantages of the WebSocket interface.

@jeffpaul This service provides a lot of features, few of them such as: Speaker labels, Profanity filtering and Background audio suppression, etc.

Can you expand on the use case of this feature? That way we can list out the features that can go with the implementation.

Jul 17 '22 12:07 Sidsector9

@Sidsector9 I believe the original thought on this enhancements was taking live speech and generating text from that for captions, so the speaker labels bit you highlighted probably most applies here.

Jul 27 '22 02:07 jeffpaul

Could similarly look at using OpenAI's Whisper for this.

Sep 26 '22 21:09 jeffpaul

classifai classifai copied to clipboard

Integrate Watson API for speech-to-text

Additional info:

classifai
classifai copied to clipboard