webarchives topic

List webarchives repositories

aut

133
Stars
33
Forks
Watchers

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Squidwarc

164
Stars
26
Forks
Watchers

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

warcworker

53
Stars
9
Forks
Watchers

A dockerized, queued high fidelity web archiver based on Squidwarc

warclight

48
Stars
10
Forks
Watchers

A Rails engine supporting the discovery of web archives.

robustlinks

51
Stars
6
Forks
Watchers

Links on the web break all the time, robustify them!

Seeder

15
Stars
2
Forks
Watchers

Seeder - Czech webarchive curating tool and public site