sbb_textline_detection icon indicating copy to clipboard operation
sbb_textline_detection copied to clipboard

Very good overall performance, but this one fails?

Open jbarth-ubhd opened this issue 4 years ago • 6 comments

Here the original image: https://digi.ub.uni-heidelberg.de/diglitData/v/blaeu1655bd6_-_00_129.tif

here the image fed into sbb-textline (binarized etc): https://digi.ub.uni-heidelberg.de/diglitData/v/blaeu1655bd6_-_00_129-binarized.png

and here the detected segments: grafik

model used:

-rw-r--r-- 1 jb users  458969872 Dec 10  2019 /usr/local/ocrd_models/sbb/textline/model_page_mixed_best.h5
-rw-rw-r-- 1 jb users 1194551551 Feb 13  2020 /usr/local/ocrd_models/sbb/textline/models.tar.gz
-rw-r--r-- 1 jb users  458970960 Jun 26  2019 /usr/local/ocrd_models/sbb/textline/model_strukturerkennung.h5

jbarth-ubhd avatar Oct 14 '20 14:10 jbarth-ubhd

@jbarth-ubhd weird, I have not seen any segmentation results like this coming from the tool. Can you attach the PAGE-XML as well plz?

(cc @vahidrezanezhad)

cneud avatar Oct 14 '20 14:10 cneud

I'll let in through a second time, just to be sure...

jbarth-ubhd avatar Oct 14 '20 14:10 jbarth-ubhd

yes, second try, same, result. Complete ocr-d workflow results:

https://digi.ub.uni-heidelberg.de/diglitData/v/blaeu.tgz

jbarth-ubhd avatar Oct 14 '20 14:10 jbarth-ubhd

Thanks for providing the test data. I can also confirm this via Aletheia. The issue seems to be with the region segmentation - where there are regions detected (see e.g. the marginalia on the left hand side), the textline segmentation actually works ok-ish. We will have a look what's wrong here!

Btw, just in case you missed this recent announcement in the OCR-D Chat:

[...] this is just an ahead notice that SBB/SPK will soon phase out the [development of the] sbb-textline-detector tool in favour of a new version of a layout detection tool which brings numerous improvements, such as e.g.

  • more granular CLI/API access,
  • additional layout elements being detected (marginalia, headlines, initials) and
  • generally much improved performance.

We aim to provide the new tool with OCR-D compliant interfaces via our regular GitHub https://github.com/qurator-spk [asap]

cneud avatar Oct 14 '20 14:10 cneud

I'm looking forward for this! Thanks for inspecting.

jbarth-ubhd avatar Oct 14 '20 14:10 jbarth-ubhd

Dear @jbarth-ubhd, I found some time to investigate this further and with the current version of sbb-textline-detector, while I do get the same problem with the RGB image, when using the binarized image, the segmentation result actually looks fine (see attachments).

Using RGB image rgb

Using binarized image (regions) seg_bin

Using binarizd image (lines) lines_bin

I will also try again with our new (but still work-in-progress) segmentation tool which prefers non-binarized images as input and post results here.

cneud avatar Jan 25 '21 15:01 cneud