annyang Add support for capturing interimResults (Real Time Hypothosis)

Add support for capturing interimResults (Real Time Hypothosis)

Open alanjames1987 opened this issue 9 years ago • 13 comments

It would be great to have an event listener which allows me to get hypothosis in real time, allowing the user to see what they are saying in real time, similar to how Android's speech input works.

I know the webkitSpeechRecognition allows this and I would be willing to add it once I have time.

Feb 17 '15 23:02 alanjames1987

If you're asking why I would want to have real time hypotheses it's because it would allow...

the user to see what they are saying in real time, similar to how Android's speech input works.

Basically to indicate that speech recognition is working.

If you're asking why I would want to add this feature once I have time then it's because I like this library and I want to see it have more features that other developers and I would use.

Feb 19 '15 03:02 alanjames1987

That reply has nothing to do with this thread.

@TalAter I don't know if the above comment is from a bot or a person trolling. In either case I will stop responding to it and defer to you to moderate.

Feb 19 '15 20:02 alanjames1987

@alanjames1987 This is now possible in annyang v2.0.0

Since v2.0.0, the result, resultNoMatch, resultMatch callbacks are now called with a list of phrases the user said.

So you can do something like:

annyang.addCallback('result', function(phrases) {
  console.log('Speech recognized. Possible sentences said:');
  console.log(phrases);
});

Aug 12 '15 10:08 TalAter

Does the 2.0.0 release allow for continuous / real time results?

I don't see a way to enable that.

Aug 12 '15 15:08 alanjames1987

Sorry, I was meaning interimResults. I don't see a way to enable that and that was what I was referring to when I called it "real time".

Aug 12 '15 15:08 alanjames1987

No.

I think I didn't add it originally because I didn't want to try and match interim results with commands.

But with the new callbacks that can be called with parameters, we could maybe implement interim results as a callback.

We'll need to check if we can enable it without affecting current functionality...

What do you think about this approach?

Aug 12 '15 15:08 TalAter

I think that would be a great approach.

Aug 12 '15 16:08 alanjames1987

Hi, I wanted to use your awesome library but I need interimResults on my project :/ Is it possible to implement that ? No news on this issue since 2015 :( Currently, it's very difficult for somebody to know if the application listen him without interimResult. I only need a simple render like this one : https://www.google.com/intl/fr/chrome/demos/speech.html (grey interim results, black final results) :)

EDIT: I find a workaround but I think that implement interim results as a callback is a better solution.

  var recognition = annyang.getSpeechRecognizer();
  var final_transcript = '';
  recognition.interimResults = true;
  annyang.start();

      recognition.onresult = function(event) {
        var interim_transcript = '';
        final_transcript = '';
        for (var i = event.resultIndex; i < event.results.length; ++i) {
            if (event.results[i].isFinal) {
                final_transcript += event.results[i][0].transcript;
                console.log("final_transcript");
                console.log(final_transcript);
                annyang.trigger(final_transcript); //If the sentence is "final" for the Web Speech API, we can try to trigger the sentence
            } else {
                interim_transcript += event.results[i][0].transcript;
                console.log("interim_transcript");
                console.log(interim_transcript);
            }
        }
        final_transcript = capitalize(final_transcript);
        final_span.innerHTML = linebreak(final_transcript);
        interim_span.innerHTML = linebreak(interim_transcript);
      };

May 03 '16 10:05 tar-gezed

Thanks @Kant73 ! I used that code to implement interim results as well and it's working great.

Sep 02 '16 13:09 busbyk

The @Kant73 solution for this issue worked perfectly for me.

Oct 09 '16 23:10 ojvribeiro

I'm trying to get interim results. I see two people here that say they used the above method and it worked. I can't seem to get it to work. Has anything changed since this was posted. Here is my exact code. I had to change it slightly from Kant73's original just because I think he only posted a snippet.

<!DOCTYPE html>
<html class="no-js consumer" lang="en">
  <head>
  <script src='//cdnjs.cloudflare.com/ajax/libs/annyang/2.6.0/annyang.min.js'></script>
<script>
  annyang.start();
  var recognition = annyang.getSpeechRecognizer();
  var final_transcript = '';
  recognition.interimResults = true;
    recognition.onresult = function(event) {
    var interim_transcript = '';
    final_transcript = '';
    for (var i = event.resultIndex; i < event.results.length; ++i) {
        if (event.results[i].isFinal) {
            final_transcript += event.results[i][0].transcript;
            console.log("final_transcript="+final_transcript);
            annyang.trigger(final_transcript); //If the sentence is "final" for the Web Speech API, we can try to trigger the sentence
        } else {
            interim_transcript += event.results[i][0].transcript;
            console.log("interim_transcript="+interim_transcript);
        }
    }

    document.getElementById('123').innerHTML =  'interim='+interim_transcript+'<br/>final='+final_transcript;
	console.log('interim='+interim_transcript+'|final='+final_transcript);
  };
</script>
</head>
<body class="" id="grid">
<br/><br/>
 Annyang! Speech Test<br/><br/>
<div id='123'>
No results yet
 </div>
</body>
 </html>`

Thanks.

Jan 11 '17 04:01 Thread7

Argh. Sorry I figured it out 2 minutes after posting this. The interim went so fast that it only showed up in the console but never in the <div> on the screen. I figured I'd leave my above comment in case someone wants a full example that works (at least in the console).

Jan 11 '17 04:01 Thread7

Thanks!!!! It saves me a lot :)

Jan 24 '17 04:01 gicontz

annyang annyang copied to clipboard

Add support for capturing interimResults (Real Time Hypothosis)

annyang
annyang copied to clipboard