nltk_data
nltk_data copied to clipboard
NCBI disease corpus
http://www.ncbi.nlm.nih.gov/CBBresearch/Dogan/DISEASE/
NCBI disease corpus is the latest version biomed disease related corpus used for biomedical research, the biocreative_ppi corpus doesn't work currently.
suggested NLTK name as 'ncbidis'
Sorry for the long delay @proline827. Will this corpus work with an existing corpus reader?
@ewan-klein – do you have experience with the biocreative_ppi corpus?
Only in the mists of history :frowning:. There must be other people out there with more up-to-date experience.
Is this available as one of the options? Or all I could do is try and load the corpus with the existing corpus reader?
Is there any update on this?
I'm happy to consider a pull request. Note the file size limit imposed by github.