RTX icon indicating copy to clipboard operation
RTX copied to clipboard

Should we ignore the nodes that are deprecated in kg2c build process?

Open chunyuma opened this issue 3 years ago • 3 comments

Based on the KG2.6.1c (http://kg2canonicalized.rtx.ai:7474/browser/) that @amykglen just built, I found that the nodes which are labelled as deprecated in KG2.6.1 are still used in KG2.6.1c. There are total 42,431 deprecated nodes in KG2.6.1. Should we ignore these nodes in KG2c build process?

match (n) where n.deprecated='True' return count(distinct n.id)
count(distinct n.id)
--
42431

Here is one example: In KG2.6.1, we have GO:0075020 which is labelled as deprecated

{
  "iri": "http://purl.obolibrary.org/obo/GO_0075020",
  "synonym": [
    "Ca++ or calmodulin-mediated activation of appressorium formation",
    "Ca2+ or calmodulin-mediated activation of appressorium formation"
  ],
  "category_label": "biological_process",
  "deprecated": "True",
  "name": "obsolete calcium or calmodulin-mediated activation of appressorium formation",
  "description": "Any process that modulates the frequency, rate or extent of symbiont calcium or calmodulin-mediated signal transduction during appressorium formation on or near its host organism. The host is defined as the larger of the organisms involved in a symbiotic interaction. [GOC:pamgo_curators]; OBSOLETE. Any process that modulates the frequency, rate or extent of symbiont calcium or calmodulin-mediated signal transduction during appressorium formation on or near its host organism. The host is defined as the larger of the organisms involved in a symbiotic interaction. // COMMENTS: This term was obsoleted because it represents a GO-CAM model.; UMLS Semantic Type: UMLS_STY:T038",
  "provided_by": "umls_source:GO",
  "id": "GO:0075020",
  "category": "biolink:BiologicalProcess",
  "update_date": "20210201"
}

But in KG2.6.1c, it still exists there.

{
  "iri": "http://purl.obolibrary.org/obo/GO_0075020",
  "expanded_categories": [
    "biolink:BiologicalEntity",
    "biolink:BiologicalProcess",
    "biolink:BiologicalProcessOrActivity",
    "biolink:NamedThing"
  ],
  "name": "obsolete calcium or calmodulin-mediated activation of appressorium formation",
  "description": "Any process that modulates the frequency, rate or extent of symbiont calcium or calmodulin-mediated signal transduction during appressorium formation on or near its host organism. The host is defined as the larger of the organisms involved in a symbiotic interaction. [GOC:pamgo_curators]; OBSOLETE. Any process that modulates the frequency, rate or extent of symbiont calcium or calmodulin-mediated signal transduction during appressorium formation on or near its host organism. The host is defined as the larger of the organisms involved in a symbiotic interaction. // COMMENTS: This term was obsoleted because it represents a GO-CAM model.; UMLS Semantic Type: UMLS_STY:T038",
  "equivalent_curies": [
    "GO:0075020"
  ],
  "id": "GO:0075020",
  "category": "biolink:BiologicalProcess",
  "all_names": [
    "obsolete calcium or calmodulin-mediated activation of appressorium formation"
  ],
  "all_categories": [
    "biolink:BiologicalProcess"

chunyuma avatar Apr 30 '21 04:04 chunyuma