entity-recognition-datasets
entity-recognition-datasets copied to clipboard
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Dear authors, this repository is such a great resource! Many thanks for creating it. I would like to suggest that maybe the Open Research Knowledge Graph (https://orkg.org/) could be leveraged...
WikiANN https://elisa-ie.github.io/wikiann/ Here are some more resources : https://www.orkg.org/orkg/comparison/R166240
http://kimi.ml.cmu.edu/transfer/data.tar.gz
I just ran into this list - thanks for putting it up. I curate the GUM corpus included in the data folder, but it seems to be a rather old...
I am trying to convert the NIST-IEER to CoNLL format and see the following error: It looks like it gets through the first 6 files fine but only gets partway...
Sorry, is there a demo to show how to use your scripts?
[flake8](http://flake8.pycqa.org) testing of https://github.com/juand-r/entity-recognition-datasets on Python 3.7.0 $ __flake8 . --count --select=E901,E999,F821,F822,F823 --show-source --statistics__ ``` ./data/NIST_IEER/CONLL-format/utils/quick_comma_fix.py:41:37: E999 SyntaxError: invalid syntax print annotations ^ ./data/NIST_IEER/CONLL-format/utils/makeconll.py:29:30: E999 SyntaxError: invalid syntax print category...
Bumps [scikit-learn](https://github.com/scikit-learn/scikit-learn) from 0.18.2 to 1.5.0. Release notes Sourced from scikit-learn's releases. Scikit-learn 1.5.0 We're happy to announce the 1.5.0 release. You can read the release highlights under https://scikit-learn.org/stable/auto_examples/release_highlights/plot_release_highlights_1_5_0.html and...