tesseract icon indicating copy to clipboard operation
tesseract copied to clipboard

This might be the fix for the upper disappearing word of issue 3871

Open rmast opened this issue 1 year ago • 3 comments

https://github.com/tesseract-ocr/tesseract/issues/3871

rmast avatar Aug 15 '22 13:08 rmast

This does not work for -c edges_use_new_outline_complexity=1

rmast avatar Aug 16 '22 20:08 rmast

It might be this 'fix' revives the 'rejected parent' from the bucket, while other rejected blobs should probably still be available and are killed with the empty word as well. I think the children are mistrusted, as those blobs come by as well making the row, including the dot on the i. As the new layers of tesseract even know how to revive a parent I guess they could cope better with those mistrusted children as well. I'm trying to get to the right point in the debugger...

rmast avatar Aug 17 '22 09:08 rmast

I changed the strategy of keeping the parent alive to just kill the parent as soon as it is known. That solves the issues on both lines. I wonder whether other examples can reveal a motivation for reviving the parent to a lesser extent.

rmast avatar Aug 19 '22 09:08 rmast