biolink-model icon indicating copy to clipboard operation
biolink-model copied to clipboard

Similarity metrics based on CMAP: how to represent these in Biolink Model?

Open nlharris opened this issue 3 years ago • 4 comments

Is your feature request related to a problem? Please describe. @cbizon reported, on behalf of the Molecular Data KP, "the molpro team has some interesting similarity metrics based on CMAP. They compare gene expression profiles with added compound to either gene expressions from gene X KO or amplification. So then they get relations like chemical A has a similar effect on gene expression as knocking out gene B. Or chemical A has a similar effect on gene expression as enhancing gene Q. And on top of that, there is the ability to do all of those calculations for a given set of genes for the gene expression (i.e. some kind of context for the similarity calculation)."

Describe the solution you'd like Need way to represent these similarity metrics in Biolink Model.

What working group (or team) did this request originate from? Molecular Data KP

Additional context "The actual biology is that (say) the chemical has a similar effect in gene expression as increasing or decreasing another gene. But the increasing/decreasing is important, so just saying chemical-similarto-gene doesn't tell you enough. So how do you represent that? Is it qualifiers on similarity or new predicates, or using some kind of action nodes, or what?" (@cbizon)

Tag relevant members for discussion @vdancik, @remontoire-pac, @sandrine-m

nlharris avatar Oct 06 '20 19:10 nlharris

#503 is a very similar (no pun intended :)) request.

sierra-moxon avatar Sep 14 '21 18:09 sierra-moxon

Maybe the issue #473 is raising an additional point that issue #503 is not dealing with: the ability to compute similarity at the gene set level (dealing with gene lists). This discussion could be linked also to the discussion on the past relay about ability to make query at the gene list level.

sandrine-m avatar Sep 15 '21 00:09 sandrine-m

@sandrine-m - we plan on covering this use case at the data modeling team meeting this week (Oct 28) if you have the time to join us? We also documented some possibilities that we'd like feedback on here: https://github.com/biolink/biolink-model/discussions/870

sierra-moxon avatar Oct 26 '21 21:10 sierra-moxon

@sierra-moxon Thank you! I'll be there! I will study the issue 870.

sandrine-m avatar Oct 26 '21 22:10 sandrine-m