document-understanding topic
awesome-table-structure-recognition
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
chug
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
Document-Layout-Analysis
Object Detection Model for Scanned Documents
Checkbox-Detection
Checkbox Detection Model for Scanned Documents
RFUND
Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"
PEneo
[MM'2024] Official implementation of "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction."