pdf-extractor-rag topic
List
pdf-extractor-rag repositories
PaddleOCR
63.9k
Stars
9.3k
Forks
63.9k
Watchers
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
MinerU
22.4k
Stars
1.6k
Forks
Watchers
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。