archivespark topic
List
archivespark repositories
ArchiveSpark
141
Stars
19
Forks
Watchers
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.