Robert Knight comments

Results 727 comments of


                                            Robert Knight

ocrs crashes one of my computers but runs fine on a weaker but more modern CPU

Based on discussions elsewhere it seems like a hardware issue that is out of scope for this project to resolve. Per my last comment, I might add a way to...

Add evaluation benchmarks

> What datasets were you considering? I'm currently working on a synthetic data generator. This has the advantage that it can provide coverage of many languages and domains, as long...

> https://github.com/Belval/TextRecognitionDataGenerator also sounds interesting for this. I started with this project. I found that a recognition model trained on output from an unmodified version of it achieves very low...

Add evaluation benchmarks

There are a few dimensions to consider: - Model size: Larger models can store more knowledge/patterns but are slower to execute and use more memory. - Functionality: Some models can...

Is PDF / DOCX support on the roadmap?

Ocrs could potentially integrate with existing libraries or CLI tools for rendering PDFs somehow. It could also serve as a backend for a project like [OCRmyPDF](https://github.com/ocrmypdf/OCRmyPDF). What use case did...

Is PDF / DOCX support on the roadmap?

Rendering a PDF into an image is a complex task which will either involve using platform-specific libraries or compiling large dependencies. Rather than add that into the `ocrs` crate I...

Automatically downscale large input images

Can you provide an image(s) that are representative of the ones that you are trying to extract data from, along with some details of the system you are running the...

Automatically downscale large input images

That single image takes 850 milliseconds on my i5 laptop with `ocrs image.png`. Are you using a release build, or at least building the `rten-*` dependencies in release mode? Debug...

Automatically downscale large input images

> Looking at some of the model code, am I correct in thinking that the expected image size for detection is [800x600](https://github.com/robertknight/ocrs-models/blob/1f75caac40119282420a4df60d23025210019854/ocrs_models/train_detection.py#L22C1-L24C38) and the individual detected lines will be scaled...

Support table

I agree this would be a useful addition in the future. My focus at the moment is basic recognition accuracy and multi-language support. I imagine table detection/extraction would slot in...