mondo icon indicating copy to clipboard operation
mondo copied to clipboard

OMIM synonym scope needs review

Open nicolevasilevsky opened this issue 4 years ago • 7 comments

Action items:

  • [ ] 1. most OMIM synonyms should be exact (not related)
  • [ ] 2. most NCIT and GARD syns should be exact too

I think we could do this:

  1. change all OMIM synonyms from related to exact,
  2. then check where there are duplicate synonyms coming from OMIM and manually review these, and then make them related synonyms, if applicable (or maybe broad, in the example from Donna below). (it would probably be good to do this for all synonyms).
  3. (From @maglott) We should do a special review of alternate names for 'disease name 1' because it may be that some of the alternate names are more correctly synonyms for 'disease name'

Related to https://github.com/monarch-initiative/mondo/issues/1794

I just created this and realized it is a duplicate of https://github.com/monarch-initiative/mondo/issues/2255

nicolevasilevsky avatar Feb 05 '21 21:02 nicolevasilevsky

From @maglott on #1794 (Just copying over the comments)

I am a bit skeptical about 'All OMIM synonyms should be exact (not related)'

  • there are cases like https://www.omim.org/entry/602429 and https://www.omim.org/entry/603383 where the same alternate name is used on multiple records.
  • there are cases like the records for gene-specific name 1 where one of the alternate name is likely synonymous with the phenotypic series, not the gene-specific entity.

Are these cases being detected?

nicolevasilevsky avatar Feb 05 '21 21:02 nicolevasilevsky

@nicolevasilevsky Do you think you could take a look at the output file of the OMIM ingest and see if it is satisfactory? Regarding action item (1), I looked and there are ~414~ ~48,974 exactSynonyms and ~158~ 18,358 related synonyms in the output. For action item (2), I don't see 'GARD' or 'NCIT' in the prefix list, so not sure how to check that.

joeflack4 avatar Oct 18 '21 00:10 joeflack4

@nicolevasilevsky please bring this issue to the next QC call, as this requires discussion.

matentzn avatar Oct 18 '21 09:10 matentzn

we can make a blanket assumption that related OMIM syns should be exact but it will need manual review

run query - in branch, we'll see diff and we can click on change in GitHub Desktop and can undo in GH Desktop

nicolevasilevsky avatar Oct 22 '21 16:10 nicolevasilevsky

I still think they should be exact only if unique.

maglott avatar Oct 22 '21 17:10 maglott

I agree @maglott - @nicolevasilevsky we can easily test for uniqueness! Just make sure to remind me when we work on this!

matentzn avatar Oct 22 '21 17:10 matentzn

Action item:

  • [ ] spot check the ttl file (3 classes) and compare to the OMIM record and make sure the exact syns are alternative terms and included terms are related
  • [ ] ask Nico and joe to update the existing Mondo terms

nicolevasilevsky avatar Nov 12 '21 17:11 nicolevasilevsky