document-parser topic

List document-parser repositories

unstructured

8.6k
Stars
702
Forks
43
Watchers

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

smart-docs-parser

21
Stars
7
Forks
Watchers

An OCR based document parser to extract information from identity document images

papercast

32
Stars
1
Forks
Watchers

A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...

marie-ai

59
Stars
5
Forks
Watchers

Integrate AI-powered Document Analysis Pipelines

ragflow

56.8k
Stars
5.6k
Forks
263
Watchers

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

semantic-ai

18
Stars
1
Forks
Watchers

An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Lar...

Invoiceable

23
Stars
2
Forks
Watchers

The invoice, document, and résumé parser powered by AI.