table-extraction topic
PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
camelot-sharp
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).
img2txt
Easy formatted text extraction from images using Google Vision API
img2table
img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing
quipucamayoc
dev repo for article
Go5-Project
Extract Tabular data from Image to Excel files
table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evalu...
pdf2table
PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz
awesome-table-structure-recognition
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
parsee-pdf-reader
Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-paragraphs. Full support for scans and images.