document-analysis topic

List document-analysis repositories

AdverseBiNet

38
Stars
9
Forks
Watchers

Improving Document Binarization via Adversarial Noise-Texture Augmentation (ICIP 2019)

Document_Layout_Analysis-MonkAI

26
Stars
6
Forks
Watchers

DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.

assemblyline

194
Stars
13
Forks
Watchers

AssemblyLine 4: File triage and malware analysis

ViBERTgrid-PyTorch

52
Stars
5
Forks
Watchers

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

AdvancedLiterateMachinery

1.4k
Stars
164
Forks
Watchers

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)

Powerful web application that combines Streamlit, LangChain, and Pinecone to simplify document analysis. Powered by OpenAI's GPT-3, RAG enables dynamic, interactive document conversations, making it i...

detectron2-publaynet

46
Stars
6
Forks
Watchers

Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset

amazon-textract-transformer-pipeline

88
Stars
25
Forks
Watchers

Post-process Amazon Textract results with Hugging Face transformer models for document understanding

docvisor

19
Stars
4
Forks
Watchers

An open-source tool for visualisation of outputs of deep-learning models for document analysis tasks such as fully automatic, bounding box and OCR.