Open-Data-Lab icon indicating copy to clipboard operation
Open-Data-Lab copied to clipboard

Identify datasets for potential inclusion in the ODL

Open Daniel-Mietchen opened this issue 6 years ago • 8 comments

One way to start looking into this would be to check open resources like

  • https://github.com/awesomedata/awesome-public-datasets and see how sustainable/ usable the data are there.

On that basis, we could then decide (see also the inclusion criteria in ODL, as per #18 ) as to whether we'd like to go for datasets scoring high and/or low / average on those scales.

Daniel-Mietchen avatar Oct 24 '18 00:10 Daniel-Mietchen

Another potential candidate: http://retractiondatabase.org/ — described by some as "antediluvian".

Daniel-Mietchen avatar Oct 25 '18 23:10 Daniel-Mietchen

Another one: https://orcid.org/blog/2018/10/24/2018-public-data-file .

Daniel-Mietchen avatar Oct 29 '18 13:10 Daniel-Mietchen

Datasets and code involved in projects for which there is a bug bounty, e.g. https://rubenarslan.github.io/posts/2018-10-26-on-making-mistakes-and-my-bug-bounty-program/ .

Daniel-Mietchen avatar Oct 31 '18 02:10 Daniel-Mietchen

allofplos, as per https://github.com/PLOS/allofplos

Daniel-Mietchen avatar Nov 02 '18 08:11 Daniel-Mietchen

https://doi.org/10.5061%2Fdryad.n5g39d7 - & mdash; probably the most comprehensive public dataset about Hemimastigophora to date

Daniel-Mietchen avatar Nov 18 '18 07:11 Daniel-Mietchen

"Teaching data science with real world datasets" https://twitter.com/emcandre/status/1068139908836012032

Daniel-Mietchen avatar Nov 29 '18 15:11 Daniel-Mietchen

Gaia star catalog data, as per http://sci.esa.int/gaia/60192-gaia-creates-richest-star-map-of-our-galaxy-and-beyond/

Daniel-Mietchen avatar Dec 15 '18 23:12 Daniel-Mietchen

Here is some inspiration from the kinds of data and related services hosted at IDigInfo's data portal:

  • https://idiginfo.org/?q=projects

Daniel-Mietchen avatar Jan 03 '19 15:01 Daniel-Mietchen