document-parser topic

List document-parser repositories

opencv-text-deskew

49
Stars
16
Forks
Watchers

Tutorial on how to deskew (straighten) text images

unstructured

6.8k
Stars
519
Forks
43
Watchers

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

smart-docs-parser

21
Stars
7
Forks
Watchers

An OCR based document parser to extract information from identity document images

papercast

32
Stars
1
Forks
Watchers

A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...

marie-ai

45
Stars
2
Forks
Watchers

Integrate AI-powered Document Analysis Pipelines

ragflow

7.7k
Stars
685
Forks
47
Watchers

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

semantic-ai

17
Stars
1
Forks
Watchers

An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Lar...