Results 538 comments of Amit Dovev

https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#error-messages-from-training Try running text2inage with this option: `--output_word_boxes`

>Error in boxClipToRectangle: box outside rectangle >Error in pixScanForForeground: invalid box Error in >boxClipToRectangle: box outside rectangle Error in >pixScanForForeground: invalid box This is [a known bug](https://github.com/tesseract-ocr/tesseract/labels/boxClipToRectangle). It is caused...

>Which part is known? This only refers to the message >Error in boxClipToRectangle ...

Fine tuning might help to detect the missing dot in `No.: `. The wrong overlapping bounding boxes is a bug in the layout analysis phase done by Tesseract. Fine tuning...

The word confidence is the lowest confidence among all symbols in a word.

A symbol confidence of 0 looks odd. It should be interpreted as: "It's looks like there is a symbol there. It's hard to recognize, but here is my best guess".

Use [combine_tessdata](https://github.com/tesseract-ocr/tesseract/blob/f0b8c0254b24a3df7751708f67fa567b4aff2e11/doc/combine_tessdata.1.asc) to extract a traineddata file. Compare your ara/fas config file to the official one.

If the official model works well without a config file and your custom model does not, I don't know what's causing this issue and how it can be solved.

>Are the official best files done by Ray? Yes. For your other questions, I don't know.