Amit Dovev
Amit Dovev
They published a paper about the table detection module. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.638.7400&rep=rep1&type=pdf
Hi @Sintun! >I'm thinking about writing the API / structure part necessary to hold and access the otherwise lost information (as described by @troplin ). Is the approach described in...
I suggest that you provide an example that demonstrates the use of the new API. Good luck!
>Maybe someone with a deeper understanding of the internals could give a hint? I think only @theraysmith can help here.
>The found table rectangles are already exposed by the api, and **I am not entirely shure yet, that the table structure isn't**. It isn't. That's the whole point of this...
I just did copy and paste of the sentense and it looks okay. ٢١تشرینی یەکەمی ١٩٩٣لە ماڵی محەمەد حەمۆدا کۆ
Which font did you use? It seems to be written in Kurdish.
OK, It's "Unikurd Web".
With the first example at least, it seems that the training text contains a unseen direction character that causes the issue.
https://en.wikipedia.org/wiki/Bidirectional_Text#Table_of_possible_BiDi_character_types