pdf-parsing topic
HummusJS
Node.js module for high performance creation, modification and parsing of PDF files and streams
pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
hummusRecipe
A powerful PDF tool for NodeJS based on HummusJS.
traprange
(Java)A Method to Extract Tabular Content from PDF Files
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
py-pdf-parser
A Python tool to help extracting information from structured PDFs.
pdf4py
A PDF parser written in Python 3 with no external dependencies.
pdf-table
Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV
linkedin-pdf-parsing
Parsing resumes in a PDF format from linkedIn
pdf-extractor
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata