web-archiving topic

List web-archiving repositories

single-file-cli

505
Stars
53
Forks
Watchers

CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)

pwebarc

22
Stars
0
Forks
Watchers

A suite of tools for mirroring and hoarding web pages you visit for later offline viewing. I.e. your own personal Wayback Machine that can also archive HTTP POST requests and responses, as well as mos...

scrapy-warcio

16
Stars
6
Forks
Watchers

Support for writing WARC files with Scrapy