ExplainaBoard icon indicating copy to clipboard operation
ExplainaBoard copied to clipboard

New task: speech recognition

Open neubig opened this issue 3 years ago • 2 comments

Speech recognition is a standard generation task where the input is speech, output is text. For now, analysis could be done on the output side only.

  • Evaluation metric: word error rate, character error rate
  • Types of analysis: could be done on the level of individual deletions, substitutions, insertions. I have an example here: https://github.com/neubig/util-scripts/blob/master/counterrors.pl https://github.com/neubig/util-scripts/blob/master/error-diff.pl

neubig avatar Jan 27 '22 16:01 neubig

Sounds interesting: @neubig

  • has some datasets of this task been covered by huggingface dataset?
  • do we have en example of the system output file?

pfliu-nlp avatar Jan 28 '22 05:01 pfliu-nlp

  • Yes, here's an example: https://huggingface.co/datasets/librispeech_asr
  • I'll work on getting system outputs

neubig avatar Jan 29 '22 14:01 neubig