pdf-parser topics

PaddleOCR

62.2k

Stars

9.2k

Forks

493

Watchers

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

PaddlePaddle

chineseocr

crnn

db

ocr

pdfalyzer

222

Stars

17

Forks

Watchers

Analyze PDFs. With colors. And Yara.

michelcrypt4d4mus

malicious-pdf-files

malware-analysis

pdf

pdf-documents

nextjs-pdf-parser

46

Stars

5

Forks

Watchers

Next.js template for seamless PDF parsing using pdf2json and FilePond. Ideal for developers seeking a ready-to-use solution for PDF content extraction in Next.js projects.

tuffstuff9

content-extraction

filepond

nextjs

nextjs-pdf

Scanipy stands for "scan it with Python"—it's your smart Python library for scanning and parsing complex PDF files like books, reports, articles, and academic papers. Utilizing cutting-edge Deep Learn...

SVJLucas

deep-learning

ocr

ocr-recognition

pdf