svalbard
svalbard copied to clipboard
A global metadata vault [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ ]
The dat archive linked from the README: `de8cb55dcf2bee13b6cf86a6c4619f2368a66ffe0a0b270784bc386fcfa6ee70` does not have any peers. I assume the correct key is the one from the blog post, which at least has a...
Need a toolchain for monitoring a IA collection and extracting the .CDX file manifests for each item on an ongoing basis, and convert the URL lists to NDJSON. Here's my...
We should support "datifying" links to the following repositories (based on [this](https://gatesopenresearch.s3.amazonaws.com/resources/Data_Guidelines.pdf)) ### General data - [Dataverse](https://dataverse.harvard.edu/) - [Figshare](http://figshare.com/) - [OSF](https://osf.io/) - [Zenodo](https://zenodo.org/) ### Humanities and social science data -...
Managed to get some context by finding https://medium.com/@maxogden/project-svalbard-a-metadata-vault-for-research-data-7088239177ab via Google Search but it is otherwise totally unclear what's going on in this repo. Might want to fix that, I'm assuming...
Lots of government geo data is published on ArcGIS servers, e.g. https://coast.noaa.gov/arcgis/rest/services. A list of servers is available via scraping data.gov and opendata.arcgis.com. I've started working on an exporter that...
I believe @tlevine has prior art here: - https://github.com/tlevine/socrata-download - https://github.com/tlevine/socrata-analysis For this issue, we should be able to continously tail a socrata and export NDJSON URL metadata for the...
Need a CLI tool that can continuously monitor the main dataverse instance as well as any other satellite dataverse instance and produce NDJSON metadata listings
Work in progress by @clkao https://gist.github.com/clkao/cd5975e4a136269f092bb001b95627cd Goal is to be able to continuously monitor an FTP server and produce URL lists with its contents (file names, urls, file sizes, mime...
NPS IRMA
metadata is here, needs to be downloaded: https://github.com/ekansa/data-rescue-nps-irma
Same requirements as https://github.com/datproject/svalbard/issues/9 except for DKAN instead of CKAN