extract-table topic

List extract-table repositories

pdf2docx

2.2k
Stars
341
Forks
Watchers

Open source Python library for converting PDF to DOCX.

tabula-sharp

142
Stars
22
Forks
Watchers

Extract tables from PDF files (port of tabula-java)

camelot-sharp

31
Stars
5
Forks
Watchers

A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).

ocr-python

74
Stars
11
Forks
Watchers

OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.