owltools icon indicating copy to clipboard operation
owltools copied to clipboard

Proposal: class utility metric

Open cmungall opened this issue 10 years ago • 3 comments

Classes near the leaves of an ontology provide utility by providing specific classification and disambiguation from other entity types

Classes near the roots of an ontology provide utility in grouping other concepts. However, if the class is not acting as a superclass or other kind of grouping (e.g. partonomic) then it is not providing much utility

A utility metric should take into account both ancestors and descendants. A node should only be penalized if it hangs out near the root AND does not group classes.

A naive metric would be max distance from root plus max distance to farthest leaf

cmungall avatar Nov 03 '15 21:11 cmungall

Honest question here, but how useful is a utility metric without reference to some kind of annotation set?

kltm avatar Nov 03 '15 21:11 kltm

It can be useful for detecting dumb classes like this: http://purl.obolibrary.org/obo/UBERON_0023869

which is dumb regardless of annotation

(will be obsoleted in next release)

annotation IC can be used in place of distance but at some point this blends into http://dx.doi.org/10.1038/nbt0210-128

cmungall avatar Nov 03 '15 22:11 cmungall

It would also be useful for Bgee: we are trying to rank tissues with expression of a given gene, notably based on expression levels. But it is difficult for, e.g., in situ hybridization data. Having such a score would help.

fbastian avatar Nov 04 '15 09:11 fbastian