archivespark topic
List
archivespark repositories
trafficstars
ArchiveSpark
141
Stars
19
Forks
Watchers
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.