Musharraf comments

Results 35 comments of


Musharraf

The ability to save the scanned OCR book as a PDF or Word document, not just a text file

@DraganRatkovich It is possible, of course. But I couldn't see any benefit of those two formats over plain text. No structure information is extracted from the document, except pages and...

Remove page IDs when saving image to text or scanning to text using OCR

Hello @DraganRatkovich I may agree with removing the page numbering, but the page break char is semantically important, specially for OCR results. Anyhow, I'll make text exporting customizable. A dialog...

Remove page IDs when saving image to text or scanning to text using OCR

@DraganRatkovich Yes. the fix is coming.

Feature Request: Add Paddle OCR recognition to Bookworm

Hello @cary-rowen I investigated adding this OCR engine to Bookworm. The main road blocker here is that adding this will increase the bundle size significantly. What does those screen readers...

Feature Request: Add Paddle OCR recognition to Bookworm

Hello @cary-rowen I've been studying paddle OCR and the ways it can be added to Bookworm without bringing in a huge number of additional dependencies. The major issue is that...

Reading aloud takes a long time to start

@TheQuinbox I've some ideas to improve text processing, the least of which is to use a regex to determine the starting position of paragraphs. Alternatively, we can use a speech...

Feature request: Integrate eSpeak NG as internal TTS for Bookworm

Hello @DraganRatkovich Currently investigating the best way to implement this feature. Best Musharraf

Feature request: Integrate eSpeak NG as internal TTS for Bookworm

Hello @cary-rowen eSpeak support, if implemented, will be an optional component, just Like Tesseract or the newly landed Pandoc. Best Musharraf

Ability to highlight the text word by word when reading via TTS

> I don't know if my suggestion is technically possible at this stage (seams complex even to me (smile)), but I'm making it just in case it is or it...

Chinese book content in Mobi format is displayed as garbled characters

@cary-rowen Does this problem happen with the text of the document? or does it only happen with the table-of-content tree view labels. Best Musharraf