RTX icon indicating copy to clipboard operation
RTX copied to clipboard

Change how we handle SemMedDB edges in ranking

Open finnagin opened this issue 4 years ago • 2 comments

SemMedDB seems to be returning lots of odd edges and bad results that get pushed higher in rankings.

Lots of potential options to address this:

finnagin avatar Oct 06 '21 20:10 finnagin

Should look into averaging semeddb edge publication counts using:

  • harmonic mean
  • geometric mean
  • median
  • arithmetic mean
  • L-infinity

finnagin avatar Nov 01 '21 17:11 finnagin

On branch issue1695. Should test out the different averaging methods when combining multiple SemMedDB edges and see which ones we like. Issue #1684 is needed for the other items

dkoslicki avatar Jun 05 '22 19:06 dkoslicki

Closing as @mfl15 's approach for filtering SemMedDB will likely fix this issue

dkoslicki avatar Jun 19 '24 16:06 dkoslicki