pdf-to-txt-python
pdf-to-txt-python copied to clipboard
Simple pdf to text with python using PDFtk and PyPDF2
PDF to Text with Python
Introduction
This program will:
- Split your PDF into pages,
- Extract the text from each pages, and
- Save them in
.txt
file.
Required
Run
$ python main.py <your-pdf-file>
Why Using PDFtk?
Because PyPDF2's extract function doesn't works on some files.