web-archiving topic
archivebox-browser-extension
Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
browsertrix-crawler
Run a high-fidelity browser-based web archiving crawler in a single Docker container
ph-submissions
The repository and website hosting the peer review process for new Programming Historian lessons
auto-archiver
Automatically archive links to videos, images, and social media content from Google Sheets (and more).
browsertrix
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
web-snap
Create "perfect" snapshots of web pages
outbackcdx
Web archive index server based on RocksDB
debian-archivebox
Home of the official apt/deb package for Ubuntu/Debian-based systems.
sandcrawler
Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki