mpv icon indicating copy to clipboard operation
mpv copied to clipboard

Is it possible to OCR playing video using OCR backend like Tesseract

Open gunir opened this issue 10 months ago • 2 comments

I'm trying to make mpv OCR a video to detect subtitles and translate to another window, mostly for watching movies that I don't understand and language learning, but the issue is I can't find API to capture screen to memory, or even more important, capture region to memory (to make OCR more accurate and cost less resource, because smaller = faster ? Is there APIs like that ?

gunir avatar Apr 12 '24 06:04 gunir

There are already projects that allow you to do this.

fideliochan avatar Apr 12 '24 15:04 fideliochan

It'd be better to translate subtitles beforehand via whisper. https://github.com/abb128/LiveCaptions Also exists.

cold-nagai avatar Apr 18 '24 09:04 cold-nagai