whisper.cpp
whisper.cpp copied to clipboard
Get text divided into paragraphs?
This would be useful when transcribing to a text document because having the text divided into paragraphs makes it more readable. This may be outside the scope of this project. Just thought I would ask.
Yeah, this sounds like out of scope. Probably there is some 3rd party tool that you can apply on the output of whisper.cpp
@sindresorhus you can try build in javascript segmenter https://www.stefanjudis.com/today-i-learned/how-to-split-javascript-strings-with-intl-segmenter/
@abodacs The linked segmenter only covers sentences, not paragraphs. I'm already doing sentence segmentation in my app.
@sindresorhus Aha, you can check this discussion https://github.com/openai/whisper/discussions/552