Paul O'Leary McCann
Paul O'Leary McCann
Consider this fake article: ``` bug demo 0 245544 71636704 69945883 2019-02-12T11:24:39Z Uuu1996 1195068 blah wikitext text/x-wiki [[Category:one]] [[Category:two]] acht3hj6108tk2v3ngv7b05aerdsefc ``` Because of the way the loop that builds the...
Thank you for putting this online, but I could not run the code as provided. I am not sure how you generated requirements.txt. "Python" is not a valid entry there....
## Description This is a simple implementation of a `--dry-run` flag for `spacy download` that makes it print the URL of the package to be installed without downloading anything. I...
## Description If you have entities on a Doc, and filter the list to remove some entities, and then set the list of entities, `ent_id` and `ent_kb_id` values are not...
This is for the issue found in https://github.com/explosion/spaCy/discussions/7564. ## How to reproduce the behaviour Given a sentence, set `is_sent_start` to False in some but not all of the tokens before...
## How to reproduce the behaviour Add a PyTorchWrapper component to a language pipeline and then do this: ``` nlp.to_disk("something") nlp2 = spacy.load("something") ``` Motivating case is covered in https://github.com/explosion/spaCy/discussions/8291....
This adds a training config for the new coref component using OntoNotes / CoNLL 2012 data. The current config works, but is relatively brittle in relation to the OntoNotes setup....
Hello. I'm a [spaCy](https://github.com/explosion/spacy) core developer, and we currently use mecab-ko for our Korean language support, but we're not entirely satisfied with it because it requires mecab-ko to be installed...
Since development has moved to the Rust version, if this version is not going to be maintained (which seems likely) it might be best to archive it to make it...
While working on the spaCy Japanese model support and integrating Sudachi, ran into the issue that the one-character ellipsis (`…`) was causing errors. If you tokenize this ellipsis you get...