wordacc: text stream is too long
When running wordacc on large files in Devanagari script I am getting the error wordacc: text stream is too long.
accuracy program from this repo works fine on the same input files, though the one from earlier version was getting same error - see https://github.com/ryanfb/ancientgreekocr-ocr-evaluation-tools/issues/2
Example files:
It works when used with smaller files, example wordacc report
Hello! Thank you for the bug report. I do not have time to address this right now, but I'll try to have a look at it sometime in mid-March.
ok. Thanks.
@eddieantonio Hi, wordacc: text stream is too long...is this bug solved..if yes, please update the solution..thanks.
Hi @saijaswanth433. Sorry about that! I have not been able to focus attention to work on this project, so this bug is still open >.<
As a workaround, attempt to run this on a smaller input; 1.45 MB of text is way too much!