Search icon indicating copy to clipboard operation
Search copied to clipboard

Train and evaluate first RE models for `(GENE, is-in, BRAIN_REGION)`

Open FrancescoCasalegno opened this issue 3 years ago • 2 comments

Context

  • In #606 we collected annotated sentences for the relation extraction (RE) task for the relation type (GENE, is-in, BRAIN_REGION).
  • Effectively, this can be seen as an example of sentence classification.
  • This Issue is now about implementing and evaluating different RE model.

Actions

  • [ ] What is the percentage of sentences that the human expert discarded due to wrong NER annotations? Note that
  • [ ] Out of the annotated samples, what is the percentage of samples that are labelled as "not is-in"?
  • [ ] (Given the previous point:) What is the performance (precision, recall, f1-score) of a naïve RE model that predicts is-in anytime there's a co-mention of a GENE and a BRAIN_REGION in the same unit of text?
  • [ ] What is the performance of a pre-trained NLI model that takes as premise the paragraph and as hypothesis a sentence like [gene] is located in [brain region]?
  • [ ] What is the performance of a pre-trained binary-QA model that takes as context the paragraph and as hypothesis a sentence like Is [gene] located in [brain region]??
  • [ ] What is the performance of a custom trained relation-extraction/sentence-classification model trained on our data? Note that unlike the previous pre-trained and naïve models, in this case we will need a 5-fold cross validation and compute corresponding mean and std for the various performance metrics
  • [ ] Summarize all results in a histogram like the following: Screen Shot 2022-08-16 at 14 52 46

Dependencies

  • Before starting this Issue, #606 must be completed.

FrancescoCasalegno avatar Aug 16 '22 12:08 FrancescoCasalegno

Here are the first results. We used StratifiedKFold with n_splits=5. One potential bias of our custom model (textclassifier) is that we used the the validations sets to do early stopping.

Screenshot 2022-09-06 at 15 52 48 Screenshot 2022-09-06 at 15 53 06 Screenshot 2022-09-06 at 15 56 19

jankrepl avatar Sep 06 '22 13:09 jankrepl

See below results after adding a test set

Screenshot 2022-09-12 at 12 13 33 Screenshot 2022-09-12 at 12 15 29 Screenshot 2022-09-12 at 12 15 37

jankrepl avatar Sep 12 '22 10:09 jankrepl