Robert Knight
Robert Knight
Based on discussions elsewhere it seems like a hardware issue that is out of scope for this project to resolve. Per my last comment, I might add a way to...
> What datasets were you considering? I'm currently working on a synthetic data generator. This has the advantage that it can provide coverage of many languages and domains, as long...
> https://github.com/Belval/TextRecognitionDataGenerator also sounds interesting for this. I started with this project. I found that a recognition model trained on output from an unmodified version of it achieves very low...
There are a few dimensions to consider: - Model size: Larger models can store more knowledge/patterns but are slower to execute and use more memory. - Functionality: Some models can...
Ocrs could potentially integrate with existing libraries or CLI tools for rendering PDFs somehow. It could also serve as a backend for a project like [OCRmyPDF](https://github.com/ocrmypdf/OCRmyPDF). What use case did...
Rendering a PDF into an image is a complex task which will either involve using platform-specific libraries or compiling large dependencies. Rather than add that into the `ocrs` crate I...
Can you provide an image(s) that are representative of the ones that you are trying to extract data from, along with some details of the system you are running the...
That single image takes 850 milliseconds on my i5 laptop with `ocrs image.png`. Are you using a release build, or at least building the `rten-*` dependencies in release mode? Debug...
> Looking at some of the model code, am I correct in thinking that the expected image size for detection is [800x600](https://github.com/robertknight/ocrs-models/blob/1f75caac40119282420a4df60d23025210019854/ocrs_models/train_detection.py#L22C1-L24C38) and the individual detected lines will be scaled...
I agree this would be a useful addition in the future. My focus at the moment is basic recognition accuracy and multi-language support. I imagine table detection/extraction would slot in...