pdf-extractor topic

List pdf-extractor repositories

pdfsam

3.1k
Stars
317
Forks
Watchers

PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages

doc_crawler.py

20
Stars
7
Forks
Watchers

Explore a website recursively and download all the wanted documents (PDF, ODT…)

PdfPig

1.5k
Stars
220
Forks
Watchers

Read and extract text and other content from PDFs in C# (port of PDFBox)

docnet

430
Stars
88
Forks
Watchers

DocNET is as fast PDF editing and reading library for modern .NET applications

python-pdftables-api

79
Stars
31
Forks
Watchers

Python library to interact with https://pdftables.com API

pdf-to-txt-python

19
Stars
13
Forks
Watchers

Simple pdf to text with python using PDFtk and PyPDF2