pdf_downloader
pdf_downloader copied to clipboard
A Scrapy Spider for downloading PDF files from a webpage.
Scrapy PDF Downloader
A Scrapy Spider for downloading PDF files from a webpage.
Installation
- Create a virtualenv - How to create virtualenv
- Activate the virtualenv -
source path/to/bin/activate
- Run
pip install -r requirements.txt
Note: Skip this section if you running using docker
Run
scrapy runspider pdf_downloader.py
scrapy runspider download_humblebundle.py
Run using docker
docker-compose run download
Download Humble Bundle PDF/EPUB
docker-compose run download_humblebundle