Raphael Troncy
Raphael Troncy
When we published the dataset in 2011, we didn't think about putting a license (it was open in our mind). Later on, when we have been asked, we replied that...
It seems to me that the neleval output: ``` ptp fp rtp fn precis recall fscore measure 7 5 7 4 0.583 0.636 0.609 strong_typed_mention_match ``` corresponds to the "Entity...
The Reuters corpus can be obtained without any charges from NIST: http://trec.nist.gov/data/reuters/reuters.html. Another useful resource is of course the [LDC catalog](https://catalog.ldc.upenn.edu/) Take care, there are many CoNLL formats! And 2009...
The CoNLL 2003 dataset is present in numerous github repositories, e.g. in https://github.com/synalp/NER/tree/master/corpus/CoNLL-2003. You can also download and re-built it from https://www.clips.uantwerpen.be/conll2003/ner/.
I do have an unaltered version of the dataset, built from the Reuters CD. Let me know if you need a transfer if you can show that you have signed...
See also this [pull request](https://github.com/anuzzolese/oke-challenge-2016/pull/14) where we have tried to fix numerous cases like this.
Good move @danbri ! In practice, I think that many Linked Data folks that are consuming schema.org annotations are already doing sort of this, programmatically, and an obvious effort to...
I will not be able to review this time. Thanks for the invitation.
@anuzzolese Can we please re-open this issue? This is serious, since nested entities is a very **_hard**_ problem for the community. Fine that the organizers of the challenge want to...
Thanks for having re-opened the issue. For the challenge purpose, I think you should go for your second option, i.e. remove all identification of nested entities, in both the training...