archivespark topic

List archivespark repositories

ArchiveSpark

141
Stars
19
Forks
Watchers

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.