Adriane Boyd comments

Results 347 comments of


                                            Adriane Boyd

RunTimeError: Tensor size mismatch with certain Transformers models

Thanks, the example is very helpful! We will look into it...

RunTimeError: Tensor size mismatch with certain Transformers models

Just a note that editing this setting in `config.cfg` for a trained pipeline won't change anything because these settings are only used on initialization. It will work if you're training...

`iob_to_biluo` and `biluo_to_iob` not documented

Sure, a PR would be welcome! The functions would go in this section: https://spacy.io/api/top-level#gold. The source is in `website/docs/api/top-level.md`. Don't be concerned if you can't get the website dev mode...

Filter duplicate vectors when pruning vectors

Sorry this didn't work as expected, and thanks for the suggestion! This issue is kind of low priority on our end right now, but we'll try to come back to...

Lemmatizer in French not getting the right lemma for some Verbs.

Hi, it does look like there might be a rule for `e -> er` that's missing from the French lemmatizer rules: https://github.com/explosion/spacy-lookups-data/blob/544a965501f06f55349e7402e80d6a49bc4cb3cd/spacy_lookups_data/data/fr_lemma_rules.json#L79-L125 My French is not that great, so I'm...

Lemmatizer in French not getting the right lemma for some Verbs.

There is a lemmatizer cache that would cause this behavior. You can clear it (just by hand: `nlp.get_pipe("lemmatizer").cache = {}`) or save and reload the pipeline.

Lemmatizer in French not getting the right lemma for some Verbs.

Sure, if you'd like to open a PR, please go ahead! We mainly test the lookup lemmatizers in the tests in that repo because we don't want to have to...

Lemmatizer in French not getting the right lemma for some Verbs.

The rule-based lemmatizer does have a mechanism for checking for forms like infinitives that are already lemmas and don't need to be processed further. There's not currently a check for...

Lemmas for Contractions have changed with SpaCy 3.0

That's a good point! We took the lemma exceptions out of the tokenizer (so the tokenizer is only dealing with tokenization) without moving them to a new component. We can...

Lemmas for Contractions have changed with SpaCy 3.0

Yes, the `attribute_ruler` is the right place to add these exceptions in v3. We will need to add these exceptions to the `attribute_ruler` when we configure the pretrained pipelines for...