Paul O'Leary McCann

Results 233 comments of Paul O'Leary McCann

Since the test went green, I've gone ahead and updated the requirements to use v0.6.0 of `spacy-experimental` instead of the feature branch. The build will break until the release is...

Couple of small things: The dataset parsing (mewsli, wiki parsing) doesn't seem to have outputs defined. Is that hard because they're large collections of files? Is there some placeholder we...

Can we directly provide the output of processing the dataset somehow? If hosting isn't difficult that would be easiest to set up. A more principled thing would be to pick...

I think filtering based on seed terms like that makes sense. > Upload the archives (where to?) and make them publicly available. I think the default for this would usually...

Sorry for the late reply. > I filtered by a list of European capitals now (testing the internally discussed sports teams/city names idea). The filtered Wikidata and Wikipedia dumps are...

Thanks for the heads-up on `python-mecab-ko` and offer the help check! It might take me a bit but I'll see about creating a `fugashi-ko` and check back here when it's...

I never got around to doing this with fugashi, but someone made a package called [pymecab-ko](https://github.com/NoUnique/pymecab-ko) that's like mecab-python3 for mecab-ko, so that might be useful to anyone who was...

@NoUnique Thank you for making pymecab-ko, it's a great project to have! Thank you also for the extra information about custom dictionary usage and the new dictionary release. It's great...

Just want to pop in here and note that neuralcoref is not compatible with spaCy v3 as indicated in https://github.com/huggingface/neuralcoref/issues/295#issuecomment-859005189. We're working on a new coref model with a different...

I used the sample application from [the gunicorn docs](https://docs.gunicorn.org/en/stable/run.html) and imported spaCy at the top and it worked without issue. Could you give example code to reproduce your issue and...