Balearica

Results 43 issues of Balearica

Replacing `child_process` with `worker_threads` should yield better performance and a smaller memory footprint in general. Additionally, as images being serialized/deserialized when passed to child processes appears to be the major...

There are currently many complaints regarding slow performance--sometimes the root cause is the .wasm Tesseract engine ([tesseract.js-core](https://github.com/naptha/tesseract.js-core)) while other times the root cause has been a JavaScript function in this...

### Overview **Note: If you are a user experiencing a memory error and just want a fix, skip to the bottom.** In version 2.1.2 a feature was added to auto-rotate...

### Environment * **Tesseract Version**: 5.2.0 * **Commit Number**: 15200c6fe7733f71a6cf52fbc1e4d94150f9f168 * **Platform**: Linux ubuntu 5.15.0-43-generic ### Current Behavior: The legacy engine often fails to recognize words where characters towards the...

legacy

As noted in the [documentation ](https://tesseract-ocr.github.io/tessdoc/ImproveQuality.html#rotation--deskewing), Tesseract performs poorly when the page is at an angle (not a multiple of 90 degrees). This limitation is not problematic from an accuracy...

feature request

There are currently many issues that appear to stem from 2 problems in how caching works at present. 1. We assume that workers are created synchronously, and violating this assumption...

# Overview While bug fixes continue to be released for Version 3, all breaking changes will be released in Version 4, which is currently under development in the branch named...

At present, using a scheduler assumes that all workers are fungible, so any job can be sent to any worker. This means that any tasks that require 2+ "jobs" being...

Tesseract performs extremely poorly when text is at an angle. For example, below is a scan with ~5 degrees of rotation. The first image shows the text Tesseract recognized without...