Internet Archive

Results 29 repositories owned by Internet Archive

openlibrary

4.9k
Stars
1.2k
Forks
Watchers

One webpage for every book ever published!

bookserver

115
Stars
19
Forks
Watchers

Archive.org OPDS Bookserver - A standard for digital book distribution

bookreader

938
Stars
409
Forks
Watchers

The Internet Archive BookReader

brozzler

621
Stars
98
Forks
Watchers

brozzler - distributed browser-based web crawler

heritrix3

2.7k
Stars
755
Forks
Watchers

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

fatcat

109
Stars
18
Forks
Watchers

Perpetual Access To The Scholarly Record

archive-pdf-tools

82
Stars
13
Forks
Watchers

Fast PDF generation and compression. Deals with millions of pages daily.

cdx-summary

47
Stars
7
Forks
Watchers

Summarize web archive capture index (CDX) files.