InbarShapira

Results 3 issues of InbarShapira

It is not clear when creating training data using tesstain.sh for the LSTM model should I use --langdata_dir langdata_lstm or to use --langdata_dir langdata? It defect which eng.training_text file will...

### Bug There are cases that docling_parser_v2 spilt words to it characters or connect words Example1: Original text: products that were recently iroduced markdown: products that were re c e...

bug
pdf parsing

### Bug 1. I see cases where I get overlapping clusters - causing cells to be duplicated 2. I see cases where cells are assigned to wrong cluster ### Steps...

bug
layout