pdf-extractor-rag topic

List pdf-extractor-rag repositories

PaddleOCR

66.5k
Stars
9.5k
Forks
66.5k
Watchers

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

MinerU

50.7k
Stars
4.2k
Forks
50.7k
Watchers

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

MinerU-OneClick

17
Stars
3
Forks
17
Watchers

MinerU免安装部署一键启动整合包