mupdf.js icon indicating copy to clipboard operation
mupdf.js copied to clipboard

OCR

Open gsemyong opened this issue 1 year ago • 2 comments

Haven't found any reference of OCR in the mupdf.js docs, but see that tesseract is mupdf's optional dependency. Is there an option do OCR using mupdf.js?

gsemyong avatar Apr 13 '24 16:04 gsemyong

There is no option for OCR - this would add considerable megabytes to the codebase, something we can't afford for web runtime. OCR depends on a big and heavy set of libraries and also needs per-language training data files that we need to provide it access to. Possibly at some point we might consider a plugin for OCR, but this would have to be a separate project.

jamie-lemon avatar Apr 15 '24 11:04 jamie-lemon

There is no option for OCR - this would add considerable megabytes to the codebase, something we can't afford for web runtime. OCR depends on a big and heavy set of libraries and also needs per-language training data files that we need to provide it access to. Possibly at some point we might consider a plugin for OCR, but this would have to be a separate project.

WELL, we can! check https://github.com/gutenye/ocr?tab=readme-ov-file#browser

gowy222 avatar May 18 '25 05:05 gowy222