obi icon indicating copy to clipboard operation
obi copied to clipboard

Clarification regarding identifier modeling pattern in OBI (and OBO) needed

Open StroemPhi opened this issue 2 years ago • 2 comments

The problem

There are identifieres, e.g. DOI, PubMed ID or specimen identifier, subsumed under IAO:symbol & IAO:centrally registered identifier symbol in OBI, although there is also the class IAO:identifier & IAO:centrally registered identifier. While some OBO ontologies follow OBI, reusing these classes and/or defining new identifiers also under IAO:symbol or IAO:CRID symbol, some use IAO:identifier and IAO:CRID identifier to do so.

This discrepancy has historical reasons, since IAO:identifier most likely wasn't available back when these identifiers were coined in OBI, as it was first defined in PNO using the IAO namespace and was only recently merged into IAO (https://github.com/information-artifact-ontology/IAO/issues/236). I feel a bit bad having been the one starting that overdue integration proposal and now also am the one pointing to the not yet addressed consequences it entails. But I now know more about previous OBO work and that this issue has been lingering around for more than a decade (see the very important context here & here). So, with the pending integration of PNO into IAO, I think we have a great opportunity to work on harmonizing the different currently existing practices of modeling identifiers either as subclasses of IAO:symbol or of IAO:identifier by making really clear what the difference between IAO:symbol and IAO:identifier are. I already referred to this discrepancy in this now closed IAO issue and there @zhengj2007 suggested that writing an issue here might be best, although it is equally relevant for IAO and other ontologies reusing and or defining identifiers.

I've tried my best to gather the information I could find regarding this issue (mostly by using OLS and following links in editor notes/comments or GH issues/PRs), which will hopefully allow us to sort things out in a way most or hopefully all can agree on.

Which OBO ontologies follow the OBI pattern, using IAO:symbol as parent for identifiers?

Which OBO ontologies follow the PNO pattern, using IAO:identifier as parent for identifiers?

Which OBO ontologies use a mix of either IAO:symbol or IAO:identifier as parent for identifiers?

Which OBO ontologies use identifier like classes defined directly under IAO:ICE?

Probably not relevant for OBO?

In AFO we can see the "mixup" of what is considered an identifier by looking at how they extend IAO:symbol as well as IAO:identifier. What is most obvious is the fact that they don't update upstream changes regularly (which was confirmed to my via email from the company that is contracted by Allotrope to do the ontology dev work).

Other not yet refenced and related OBI issues I found

  • https://github.com/biobanking/biobanking/issues/88
  • https://github.com/obi-ontology/obi/issues/1246

StroemPhi avatar Nov 02 '22 16:11 StroemPhi

probably other historically relevant context: http://icbo.buffalo.edu/Presentations/Ruttenberg.pdf ICBO2012 presentation?

  • relevant section on identifiers starting with page 65
  • relevant section on symbols startig with page 96

StroemPhi avatar Nov 02 '22 21:11 StroemPhi

In how far is CEUSTERS 2012 - An Information Artifact Ontology Perspective on Data Collections and Associated Representational Artifacts relevant for this issue? Could "term" and "denotator" be helpful destinctions in the clarification?

StroemPhi avatar Nov 02 '22 21:11 StroemPhi