website icon indicating copy to clipboard operation
website copied to clipboard

Generate and publish full data dumps

Open Abbe98 opened this issue 4 years ago • 2 comments

As a data journalist/archivist I will ingest and filter the data in my own tooling.

Task:

Generate data dumps using the snowman application/sparql-results+json cache by turning it into CSV or another format more common than SPARQL resultsets.

Abbe98 avatar Jul 06 '21 12:07 Abbe98

Following support in Snowman for invalidating unused cache items(https://github.com/glaciers-in-archives/snowman/issues/5) we could publish the Snowman cache as our data dump, but we might want to rename the files somehow as they are currently named by SHA-hashes.

Abbe98 avatar Oct 22 '21 13:10 Abbe98

The browser extension currently bundles such a dump.

Abbe98 avatar Jan 27 '23 14:01 Abbe98