snorkeling icon indicating copy to clipboard operation
snorkeling copied to clipboard

Miscellaneous notes from the great snorkeling of 2018

Open dhimmel opened this issue 6 years ago • 3 comments

In Palo Alto.

dhimmel avatar Apr 30 '18 16:04 dhimmel

Monday

  • [x] When using labeling functions to suppress mistagged genes, never return positive evidence, just 0 or -1. source
  • [x] Make LFS a dictionary of name to function

Issue

Hetionet labeling function is mostly voting 1 rather than -1 (almost all sentences seem to have a gene and disease for a relationship in Hetionet, regardless of whether the sentence attests to that relationship)

dhimmel avatar Apr 30 '18 16:04 dhimmel

Tuesday

  • [x] Scale up to 50k labeled sentences
  • [ ] Consider labeling dev set
  • [ ] Determine how we want label probabilities to be scaled

dhimmel avatar May 01 '18 22:05 dhimmel

Human calls for 100 development sentences

I've gone through 100 sentences, which will be useful for assessing our generative model (consensus/training labels). These are good examples to look at to see why this is a very hard problem. sentence-labels-dev.xlsx. CC @danich1

dhimmel avatar May 04 '18 20:05 dhimmel