textra
textra copied to clipboard
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
Using VisionKit, there's two main ways to get text from images. - [ImageAnalyzer](https://developer.apple.com/documentation/visionkit/imageanalyzer) - [VNRecognizeRequest](https://developer.apple.com/documentation/vision/vnrecognizetextrequest) Based on some OCR tests, I'm seeing that the outputs from these two methods are...
I would love if this could recognize and output structured text! Any plans to implement this? https://developer.apple.com/documentation/vision/structuring_recognized_text_on_a_document
Seems like a great project that would benefit from submission to home-brew?
Apparently on-device speech recognition is only supported on some Mac devices. https://support.apple.com/guide/mac-help/if-dictation-on-mac-doesnt-work-as-expected-mchlc480652b/mac: This issue tracks adding a feature to support speech recognition powered by Apple. Since it would require sending...
It should potentially identify this issue and respond with either an informative error message or resize the image down before processing it
Does this tool work with handwritten text as well, for example handwritten notes or images/scans of handwritten flipcharts?