mpv Is it possible to OCR playing video using OCR backend like Tesseract

Is it possible to OCR playing video using OCR backend like Tesseract

Open gunir opened this issue 10 months ago • 2 comments

I'm trying to make mpv OCR a video to detect subtitles and translate to another window, mostly for watching movies that I don't understand and language learning, but the issue is I can't find API to capture screen to memory, or even more important, capture region to memory (to make OCR more accurate and cost less resource, because smaller = faster ? Is there APIs like that ?

Apr 12 '24 06:04 gunir

There are already projects that allow you to do this.

Apr 12 '24 15:04 fideliochan

It'd be better to translate subtitles beforehand via whisper. https://github.com/abb128/LiveCaptions Also exists.

Apr 18 '24 09:04 cold-nagai

mpv mpv copied to clipboard

Is it possible to OCR playing video using OCR backend like Tesseract

mpv
mpv copied to clipboard