document-recognition topic
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
tesseract-recognize
Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
DocumentReader-web-python-client
Regula Document Reader web API Python 3.5+ client
ID-Card-Recognition
ID Card Recognition SDK which can recognize ID cards, Passports and Drive License from 200+ countries
mistralOCR
该仓库是一个基于Mistral API的文档识别工具,支持处理PDF和图片文件(如JPG、JPEG、PNG)。它提供图形用户界面和命令行界面,能够自动保存处理结果为Markdown格式,并支持配置文件管理和批量处理文件
OnnxTR
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR