react-transcript-editor icon indicating copy to clipboard operation
react-transcript-editor copied to clipboard

Fix: speaker segmentation doesn't lose words

Open jamesdools opened this issue 5 years ago • 0 comments

Describe what the PR does

This addresses the groupWordsInParagraphs function inside the Kaldi-to-draft editor.

Usually, we have speaker segmentation attached from Kaldi, so we haven't been calling this.

But it's throwing away any words that go beyond the last full stop (or exclamation etc.)

Eg. if you input: Live Kaldi Test. Here is some transcript data sorts to just `Live Kaldi Test.'

and if you input `Here is some transcript data, it results in an empty list of paragraphs (which bricks the whole app 🙄😄)

Super quick PR. Added some tests too - let me know if you need more context.

jamesdools avatar Apr 23 '20 13:04 jamesdools