Musharraf

Results 35 comments of Musharraf

@DraganRatkovich It is possible, of course. But I couldn't see any benefit of those two formats over plain text. No structure information is extracted from the document, except pages and...

Hello @DraganRatkovich I may agree with removing the page numbering, but the page break char is semantically important, specially for OCR results. Anyhow, I'll make text exporting customizable. A dialog...

Hello @cary-rowen I investigated adding this OCR engine to Bookworm. The main road blocker here is that adding this will increase the bundle size significantly. What does those screen readers...

Hello @cary-rowen I've been studying paddle OCR and the ways it can be added to Bookworm without bringing in a huge number of additional dependencies. The major issue is that...

@TheQuinbox I've some ideas to improve text processing, the least of which is to use a regex to determine the starting position of paragraphs. Alternatively, we can use a speech...

Hello @DraganRatkovich Currently investigating the best way to implement this feature. Best Musharraf

Hello @cary-rowen eSpeak support, if implemented, will be an optional component, just Like Tesseract or the newly landed Pandoc. Best Musharraf

> I don't know if my suggestion is technically possible at this stage (seams complex even to me (smile)), but I'm making it just in case it is or it...

@cary-rowen Does this problem happen with the text of the document? or does it only happen with the table-of-content tree view labels. Best Musharraf