olmocr icon indicating copy to clipboard operation
olmocr copied to clipboard

Describing diagrams and technical manuals

Open garethjax opened this issue 5 days ago • 0 comments

🚀 The feature, motivation and pitch

I've done something pretty rough using python and openai Vision, because i had to parse into text some technical manuals, because i have to instruct a chatbot dedicated to tech support.

The real challenge (which is currently solved by Vision) is that several pages are filled with electric schemas, and instructions, which have to be parsed and properly described.

So i'm wondering if this model will be fine tuned to do something similar?

Alternatives

Openai Vision

Additional context

Image

garethjax avatar Feb 28 '25 07:02 garethjax