Nick Sorros

Results 17 comments of Nick Sorros

💯% agree with @ivyleavedtoadflax

I think this requires further discussion. I am adding @dd207 and @aoifespenge as this is a product feature and it may also have consequences in how many citations a researcher...

I assume that this will be addressed by Matt's new model. Are there any quick fixes in the meantime? The only thing I can think is a binary classifier on...

I am under the impression that 40 characters is a good limit based on Peter's analysis, by which I mean that after 40 there are few observed false positives, meaning...

I think the analysis should be the same as we have done so far, just replication using the new model. We should aim to do it end to end but...

> Why would we need to label more data @nsorros? Btw I said to @aoifespenge today that I envisage this being another airflow task that is completed at the end...

good point and why not. the data that might need more annotating is the gold data, more titles that are matched to pubmed ids and have the neccesary metadata.