Erica Wood
Erica Wood
At least one of the UMLS examples is actually correct: `https://identifiers.org/umls:C3341209 UMLS:C3341209 has name inconsistency: Otonyctomys ; Genus Otonyctomys` http://linkedlifedata.com/resource/umls/id/C3341209 lists the correct name as "Otonyctomys". However, for `https://identifiers.org/umls:C3341248 UMLS:C3341248...
The fixes in [c3805bb](https://github.com/RTXteam/RTX-KG2/commit/c3805bbe016e8faec0860d22697bb08694890633) and [c0fb8fa](https://github.com/RTXteam/RTX-KG2/commit/c0fb8fa92a0e137f5ced72e4de440d0d555cccfc) brought the "problem node" count down to 677842. Now, only 17% of the nodes have a different name or category from their previous ETL...
The fixes in [4ce1487](https://github.com/RTXteam/RTX-KG2/commit/4ce14877f41a12dfe7af63c151c6e9b5733b5ebf) brought the "problem node" count down to 666071. Now, only 16.6% of the nodes have a different name or category from their previous ETL equivalent. This...
With the previous few commits ([c3805bb](https://github.com/RTXteam/RTX-KG2/commit/c3805bbe016e8faec0860d22697bb08694890633), [c0fb8fa](https://github.com/RTXteam/RTX-KG2/commit/c0fb8fa92a0e137f5ced72e4de440d0d555cccfc), [4ce1487](https://github.com/RTXteam/RTX-KG2/commit/4ce14877f41a12dfe7af63c151c6e9b5733b5ebf), [91e4e5e](https://github.com/RTXteam/RTX-KG2/commit/91e4e5eb8833f5836d5c4fd6a1c4840cbd39cfce), [29a4f71](https://github.com/RTXteam/RTX-KG2/commit/29a4f7175d924d93a9621ca765563ef0e609558c), [54e8cdc](https://github.com/RTXteam/RTX-KG2/commit/54e8cdc20d39e4ec4629b1a54f318a78016894d0), and [3a993ed](https://github.com/RTXteam/RTX-KG2/commit/3a993ed3b9f3145f83d215b9921353309742d9ce)), here is the updated set of inconsistencies: ``` biolink:Polypeptide---biolink:NamedThing: 111645 biolink:GrossAnatomicalStructure---biolink:AnatomicalEntity: 75418 biolink:Protein---biolink:Polypeptide: 60804 biolink:Gene---biolink:BiologicalEntity: 49759 biolink:NamedThing---biolink:BiologicalEntity:...
When excluding cases where the old category is an abstract class (`biolink:BiologicalEntity` and `biological:InformationContentEntity`, there are 523543 inconsistent nodes (out of 40095349), putting it at around 13%. It seems reasonable...
The next step in verification is to look at edge coverage and description accuracy. @acevedol is taking on this task. The next two steps in this ETL are creating the...
Edges Update: ``` Source predicate curie is missing from the YAML config file: MEDLINEPLUS:PAR Source predicate curie is missing from the YAML config file: NCI:has_data_element Source predicate curie is missing...
This is still a problem as of `KG2.8.3`: ``` match (n) where n.id in ['LOINC:LA14350-5', 'HCPCS:Level%203%3A%20J7318-J7329', 'UMLS:C0280988'] return n.id, n.category, n.name, n.provided_by ``` n.id | n.category | n.name | n.provided_by...
We've already dropped LOINC, so we can check `KG2.8.4` to see if the issue is fixed. But I do think it is likely that will fix the issue.
While looking at "Walloon Language" (from the table, since there are a lot of languages, I thought this might be an easy tree to find a better source for), I...