Adriane Boyd
Adriane Boyd
Thanks for the note, we'll take a look!
The suggestion for the lemmatizer is included in #12554. For the poor tagging, etc. with statistical models for the tokens with diacritics, I think the best option would be to...
The language and language defaults really needs to be set before the pipeline is loaded at all, but you can test this a bit by modifying the pipeline on-the-fly instead....
Thanks for the note! It does look like the package `python-mecab-ko` has had a better set of published wheels since their updates in December. We will evaluate it and consider...
My initial feeling is that `Doc.spans` is first-class member of the doc and the retokenization should also apply here. I'm not sure how complicated the implementation would be at this...
Spans are trickier than ents because ents have some concept of head/phrase that can be used to decide how to adjust the entity boundaries. My initial sketch of how this...
I can reproduce this, but it's probably related to `torch` rather than `spacy` directly and I'm not as sure about what might be going on in torch that would cause...
My feeling is that it's fine to make it more configurable, but for our current use of wasabi it makes sense for the default to stay stdout. In many cases,...
Thanks for the pointer! To a large extent `srsly` is developed for the formats we're using for Explosion packages/projects and my initial feeling is that we're unlikely to replace `msgpack-numpy`...
The `spacy-streamlit` examples are intended to be run outside a notebook with ```shell streamlit run ... ``` The concrete problem above is that `displacy.render` auto-detects that you're in a jupyter...