web-archiving topic

List web-archiving repositories

warc-parquet

100
Stars
0
Forks
Watchers

🗄️ A simple CLI for converting WARC to Parquet.

warrick

84
Stars
10
Forks
Watchers

Recover lost websites from the Web Infrastructure

node-warc

92
Stars
20
Forks
Watchers

Parse And Create Web ARChive (WARC) files with node.js

fatcat

109
Stars
18
Forks
Watchers

Perpetual Access To The Scholarly Record

bookmark-archiver

19
Stars
0
Forks
Watchers

🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...

internet-archiving-talk

47
Stars
5
Forks
Watchers

🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

homebrew-archivebox

25
Stars
3
Forks
Watchers

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

cdxj-indexer

21
Stars
11
Forks
Watchers

CDXJ Indexing of WARC/ARCs

wail

119
Stars
9
Forks
Watchers

:whale2: One-Click User Instigated Preservation

MemGator

54
Stars
11
Forks
Watchers

A Memento Aggregator CLI and Server in Go