RTX-KG2 icon indicating copy to clipboard operation
RTX-KG2 copied to clipboard

document types of edge scores that are currently in KG2

Open saramsey opened this issue 5 years ago • 7 comments

Currently we have:

  • Chembl drug-to-target scores
  • SEMMEDDB scores

are there any others?

Let's report back here.

saramsey avatar Jul 24 '20 20:07 saramsey

I ran: image

And got the following output: image

This suggests that

  • SEMMEDDB
  • HL7
  • MTH
  • MEDLINEPLUS
  • ChemBL Compound

all provide edge scores.

ecwood avatar Jul 28 '20 19:07 ecwood

JensenLab also provides Zscores for their edges

kvarforl avatar Mar 01 '21 23:03 kvarforl

Thank you, @ericawood and @kvarforl. In all cases are the scores in the subject slot of the publication info object underneath the publications_info property?

saramsey avatar Apr 08 '21 00:04 saramsey

Also, both IntAct and DisGeNET have edge scores but due to issue RTXteam/RTX-KG2#23, I didn't store them anywhere (since they're not tied to a particular PMID and it seems a bit better to wait until we get definitive guidance from the EPC WG than go back and potentially redo work later). Both ETLs pull them out into variables so that, when we're ready, we'll have them ready to go.

ecwood avatar Apr 11 '21 03:04 ecwood

If the score is a chi-square score or a p-value, there is already an association slot for it, in the Biolink model:

https://github.com/biolink/biolink-model/blob/8b5ca35ad171f5837ae3b61d3df31e5c3a04b344/biolink-model.yaml#L4665

But, to get it into Neo4j, that would necessitate creating a new column in the edges TSV file.

saramsey avatar Apr 20 '21 20:04 saramsey

Thank you!

saramsey avatar Aug 21 '23 22:08 saramsey

Are we capturing all of these types of edge scores into KG2c? @sundareswarpullela and @ecwood can you please check when you have time? At this point, I am aiming to gather information. Then we can decide about if there are specific types of edge scores we might want to backfill into KG2c. Thank you.

saramsey avatar Aug 21 '23 22:08 saramsey