Stefan Weil

Results 163 issues of Stefan Weil

On some page images full of text Tesseract does not detect any text when using the default settings. Typically it prints `Empty page!!` twice for such pages. See issue #3021...

bug
output

Performance is important for real time OCR, mass production OCR and training. In this RFC I'd like to discuss performance bottlenecks and potential improvements. See also the [Tesseract wiki](https://github.com/tesseract-ocr/tesseract/wiki/4.0-Accuracy-and-Performance). According...

performance
OpenMP
SIMD
RFC

### Bug report **What is the problem?** The ramips-mt7621 device ZyXEL NWA50AX does not run reliably in a wireless mesh with 3 such devices. In December 2022 the routers were...

The [documentation](https://github.com/python-pillow/Pillow/blob/main/docs/deprecations.rst#font-size-and-offset-methods) suggests that this old code ```python width, height = font.getsize("Hello world") ``` should be replaced by ```python left, top, right, bottom = font.getbbox("Hello world") width, height = right...

Documentation

It might sound strange, but the msys packages work nice for cross builds (targetting 32 or 64 bit Windows) on Debian / Ubuntu hosts, too. Typically a cross build on...

Is there a need to add support for the TSV format to ocr-fileformat? https://github.com/qurator-spk/page2tsv provides conversion from PAGE XML to TSV. Maybe it is sufficient to know that and use...

enhancement

As a user I don't expect that `make all` will write files to `PREFIX` (which typically requires root permission). Writing such files should only be done with `make install` or...

After running `sudo make install` many local files are owned by root which is not desired: ``` .git/modules/vendor/alto-schema/config .git/modules/vendor/format-converters/config .git/modules/vendor/gcv2hocr/config .git/modules/vendor/hocr-spec-python/config .git/modules/vendor/hOCR-to-ALTO/config .git/modules/vendor/im2alto/config .git/modules/vendor/page-to-alto/config .git/modules/vendor/xsd-validator/config .git/modules/vendor/textract2page/config vendor/page-to-alto/build/lib/ocrd_page_to_alto/__init__.py vendor/page-to-alto/build/lib/ocrd_page_to_alto/styles.py vendor/page-to-alto/build/lib/ocrd_page_to_alto/utils.py vendor/page-to-alto/build/lib/ocrd_page_to_alto/cli.py...