Paul O'Leary McCann

Results 50 issues of Paul O'Leary McCann

Consider this fake article: ``` bug demo 0 245544 71636704 69945883 2019-02-12T11:24:39Z Uuu1996 1195068 blah wikitext text/x-wiki [[Category:one]] [[Category:two]] acht3hj6108tk2v3ngv7b05aerdsefc ``` Because of the way the loop that builds the...

Thank you for putting this online, but I could not run the code as provided. I am not sure how you generated requirements.txt. "Python" is not a valid entry there....

## Description This is a simple implementation of a `--dry-run` flag for `spacy download` that makes it print the URL of the package to be installed without downloading anything. I...

enhancement
feat / cli

## Description If you have entities on a Doc, and filter the list to remove some entities, and then set the list of entities, `ent_id` and `ent_kb_id` values are not...

bug
feat / doc
🔜 v4.0

This is for the issue found in https://github.com/explosion/spaCy/discussions/7564. ## How to reproduce the behaviour Given a sentence, set `is_sent_start` to False in some but not all of the tokens before...

bug
feat / parser

## How to reproduce the behaviour Add a PyTorchWrapper component to a language pipeline and then do this: ``` nlp.to_disk("something") nlp2 = spacy.load("something") ``` Motivating case is covered in https://github.com/explosion/spaCy/discussions/8291....

bug
🔮 thinc
feat / serialize
feat / transformer

This adds a training config for the new coref component using OntoNotes / CoNLL 2012 data. The current config works, but is relatively brittle in relation to the OntoNotes setup....

enhancement

Hello. I'm a [spaCy](https://github.com/explosion/spacy) core developer, and we currently use mecab-ko for our Korean language support, but we're not entirely satisfied with it because it requires mecab-ko to be installed...

Status/help wanted

Since development has moved to the Rust version, if this version is not going to be maintained (which seems likely) it might be best to archive it to make it...

While working on the spaCy Japanese model support and integrating Sudachi, ran into the issue that the one-character ellipsis (`…`) was causing errors. If you tokenize this ellipsis you get...