dinglehopper icon indicating copy to clipboard operation
dinglehopper copied to clipboard

Feature request: status line text with segment IDs

Open bertsky opened this issue 5 years ago • 3 comments

To make navigation in the source annotation easier, show the current TextRegion / TextLine / Word / Glyph ids in the browser's status bar.

bertsky avatar Dec 05 '19 15:12 bertsky

Yes, that would be useful. This feature is somewhat connected to #10 and #5 as I need to do text extraction differently to retain the segment ids.

mikegerber avatar Dec 06 '19 12:12 mikegerber

Or you wait for the PAGE-XML DOM to give us references upwards in the hierarchy. See OCR-D/core#313 and discussion there.

bertsky avatar Dec 06 '19 13:12 bertsky

Latest master now has a tooltip to display the segment id:

image

This is currently the region id for PAGE, and the texline id for ALTO as these are the levels we are currently extracting from. (Upcoming feature #5 will give more options.)

Support to display this for the word differences is also upcoming, until then this issue should stay open.

(This feature took a while because the internal text representation needed some new plumbing.)

mikegerber avatar Oct 08 '20 11:10 mikegerber