TRex icon indicating copy to clipboard operation
TRex copied to clipboard

Tesseract integration for more languages

Open pramjana opened this issue 1 year ago • 3 comments

Hi,

Hope you are doing well!

Is there a way you can harness the tesseract (which is already installed on my system - it just needs to run using a shortcut or be available via the menubar like your app) command line application, which can provide OCR output for many other languages?

Thank you for your consideration!

Best, Pramod

pramjana avatar Jun 08 '24 11:06 pramjana

It is quite a sizable feature to add, will require a lot of work.

What languages are you looking for specifically?

melonamin avatar Jun 25 '24 14:06 melonamin

Hmm, I didn't realize it would be a lot of work. I was thinking maybe run the tesseract command with the captured image (in the background), and display the result in your app like usual. I'm from India, and looking for OCR for Telugu, Hindi (this is just a parameter to pass to tesseract) This command for the Telugu language would be like:

tesseract input_image.png output_file -l tel

pramjana avatar Jun 27 '24 04:06 pramjana

Yeah, I understand, but I want TRex to be easy to use and work out of the box. I can't ask user to install additional tool on aside.

So to do it properly I need to take tessaract library and properly integrate it. Let me think about it a bit, there are a lot of potential users in India 😄

melonamin avatar Jun 27 '24 19:06 melonamin

@pramjana well, this took a bit longer than I wanted... 😅

Please try the beta build

melonamin avatar Jul 04 '25 16:07 melonamin

Hi @melonamin, it took me long enough to notice this feature was implemented! Sorry, I just noticed in the official build and missed the beta. Trying this out now! Thanks so much for implementing this feature!

pramjana avatar Oct 17 '25 04:10 pramjana