RTX-KG2 icon indicating copy to clipboard operation
RTX-KG2 copied to clipboard

More specific categories for Interleukin 2 nodes (from LOINC/UMLS)

Open amykglen opened this issue 1 year ago • 1 comments

these three nodes have a category of NamedThing in KG2.8.2pre, which prevents them from being merged with the other nodes representing Interleukin 2 that have Protein/Gene or SmallMolecule categories during the synonymizer build:

match (n) where n.id in [
    "LOINC:MTHU015779",
    "LOINC:LP16470-4",
    "MESH:D007376"
  ] return n.id, n.name, n.category, n.description
n.id n.name n.category n.description
"LOINC:MTHU015779" "Interleukin 2" "biolink:NamedThing" "UMLS Semantic Type: STY:T129; UMLS Semantic Type: STY:T116"
"LOINC:LP16470-4" "Interleukin 2" "biolink:NamedThing" "UMLS Semantic Type: STY:T129; UMLS Semantic Type: STY:T116"
"MESH:D007376" "Interleukin-2" "biolink:NamedThing" "A soluble substance elaborated by antigen- or mitogen-stimulated T-LYMPHOCYTES which induces DNA synthesis in naive lymphocytes.; UMLS Semantic Type: STY:T121; UMLS Semantic Type: STY:T129; UMLS Semantic Type: STY:T116"

https://arax.ncats.io/devLM/?term=LOINC:MTHU015779

would it be possible to provide more specific categories for these nodes during the KG2pre build?

amykglen avatar May 26 '23 17:05 amykglen

In the Biolink model, this is how those TUIs are categorized:

  • T129: biological entity (#286)
  • T116: polypeptide
  • T121: drug

One of the ancestors of drug is chemical entity. polypeptide falls under biological entity, but chemical entity and biological entity only share named thing a common ancestor. Thus, the only common ancestor for all three TUIs is named thing. However, this only applies to the MESH term. I will look more into the other two.

ecwood avatar Jun 27 '23 21:06 ecwood