Adriane Boyd comments

Results 347 comments of


                                            Adriane Boyd

Lemmas for Contractions have changed with SpaCy 3.0

We'll plan to have some updates in the v3.1 models. This isn't 100% of the v2.3 lemma exceptions, but covers the most common contractions. You can load the patterns like...

`nlp.pipe(..., n_process>1)` won't return if wrapped by `tqdm()` and `zip()`

Thanks for the report, I can reproduce the behavior where it hangs. As a workaround, I think it works if you wrap `tqdm` around the texts rather than on `zip`:...

`nlp.pipe(..., n_process>1)` won't return if wrapped by `tqdm()` and `zip()`

As a note, I've marked this as a bug because it shouldn't hang like this, but since there's an easy workaround it's going to be pretty low priority for us...

Sentencizer fails with Armenian, Gujarati, and Icelandic

Thanks for the report, it's useful to see these kind of test cases. The underlying reasons are interactions with the tokenizer, which isn't splitting off the sentence-final punctuation into separate...

Allow evaluate CLI to take metrics from the command line arguments

Hi, yes, we definitely need to think about how to make this more flexible for new/custom components. In the meanwhile, the CLI scripts should be pretty easy to copy and...

Adding a transformer model for an existing language.

In terms of adding a new model, we already have similar training corpora set up for the standard Danish models (also DaNE, although possibly an older version) and an internal...

Adding a transformer model for an existing language.

Hi, as a note we'll be adding `da_core_news_trf` for spacy v3.1, initially configured to use `Maltehb/danish-bert-botxo`. For now the pipeline config is basically the same for all `trf` models to...

Training NER models on multiple GPUs (not just one)

The basic `spacy train` training loop only supports one GPU. I think in theory you would want to configure `ray` workers so that each was associated with one particular GPU,...

Training NER models on multiple GPUs (not just one)

If you have created a model with vectors using `spacy init vectors` (the v3 CLI command for this), you then specify it under `[initialize.vectors]` and set `include_static_vectors = true` for...

Training NER models on multiple GPUs (not just one)

Yes, just run [`spacy init vectors`](https://spacy.io/api/cli#init-vectors). The options are in a slightly different format than v2 `spacy init-model`, but it's very similar. Since spacy actually doesn't include any code for...