biolink-model icon indicating copy to clipboard operation
biolink-model copied to clipboard

BioLink Model documentation does not give context to prefixes nor CURIEs

Open cthoyt opened this issue 1 year ago • 2 comments

The documentation page for Gene (and other documentation pages) refer to prefixes and CURIEs without referencing what registry/prefix map these come from, allowing for ambiguity about what each one means.

Specifically about prefixes:

Screenshot 2023-03-09 at 17 36 03 1

Later in the document, with CURIEs

Screenshot 2023-03-09 at 17 36 57

While I personally know what a lot of these prefixes are, there are still several questions:

  1. What's the difference between WB and Wormbase?
  2. What are aspgd and dcid? These don't appear in the Bioregistry.

There should be external references to ensure there is zero ambiguity. I'd be happy to help add any missing prefixes to the Bioregistry to facilitate this.

cthoyt avatar Mar 09 '23 16:03 cthoyt

The prefix expansions are provided with the model; they can be found here: https://github.com/biolink/biolink-model/blob/master/prefix-map/biolink-model-prefix-map.json

balhoff avatar Apr 04 '23 03:04 balhoff

Thanks @balhoff. We're actually using exactly that document as a basis for alignment between BioLink and the Bioregistry. The remaining prefixes we haven't yet been able to contextualize or otherwise handle are listed in https://github.com/biopragmatics/bioregistry/blob/main/src/bioregistry/data/external/biolink/curation.tsv. Note that this contains a lot of references to hash IRIs in OBO PURL space

cthoyt avatar Apr 04 '23 08:04 cthoyt