mpv
mpv copied to clipboard
Is it possible to OCR playing video using OCR backend like Tesseract
I'm trying to make mpv OCR a video to detect subtitles and translate to another window, mostly for watching movies that I don't understand and language learning, but the issue is I can't find API to capture screen to memory, or even more important, capture region to memory (to make OCR more accurate and cost less resource, because smaller = faster ? Is there APIs like that ?
There are already projects that allow you to do this.
It'd be better to translate subtitles beforehand via whisper. https://github.com/abb128/LiveCaptions Also exists.