sssom
sssom copied to clipboard
How to accurately capture curation rules?
Curation rules can be complex, but lets start with the more simple ones we can already represent:
Domain expert decision is the default:
subject_id | relation_id | object_id | match_type | object_source |
---|---|---|---|---|
HP:001 | owl:equivalentTo | MP:001 | sssom:HumanCuration | mp.owl |
The curation rule here is: a domain expert determined or confirmed the equivalence mapping by hand.
Automated tool:
subject_id | relation_id | object_id | match_type | object_source | mapping_tool | subject_match_field | object_match_field |
---|---|---|---|---|---|---|---|
HP:001 | owl:equivalentTo | MP:001 | sssom:LexicalMatch | mp.owl | logmap | skos:exactSynonym | rdfs:label |
The curation rule is: a tool (logmap) determined the equivalence mapping by matching an exact synonym of the subject to a label of the object. (this could be further refined by adding preprocessing information, such as "stemming" to these fields).
@mellybelly and others, please add more concrete curation rules you encounter regularly when creating a mapping.
Just saw we already started a discussion here: https://github.com/mapping-commons/SSSOM/issues/31
Seems closely related to #3.