dataportals-registry icon indicating copy to clipboard operation
dataportals-registry copied to clipboard

Add data enrichment from re3data

Open ivbeg opened this issue 1 year ago • 0 comments

Current situation There are a lot of metadata about data catalogs collected in Re3Data scientific data catalog.

Interesting data from re3data:

  • keywords
  • content type
  • contact-email
  • re3data identifier
  • description
  • persistent identifiers systems
  • software
  • versioning
  • institutions
  • repository type

This data could enrich the existing catalog and be added to the entries.

Example Re3Data entry - https://www.re3data.org/repository/r3d100010078

Possible solutions There are a few possible strategies:

  1. Extract whole re3data catalog and extend existing schema and catalog entries automatically as is under re3data attribute. That means high trust on re3data metadata quality.
  2. Manually merge re3data entries with existing catalog entries and to extend existing schema without re3data attribute
  3. Add automatically only re3data identifiers and consider data enrichment later using Re3data identifier-based API.

ivbeg avatar Apr 22 '23 09:04 ivbeg