Adriane Boyd

Results 347 comments of Adriane Boyd

We'll plan to have some updates in the v3.1 models. This isn't 100% of the v2.3 lemma exceptions, but covers the most common contractions. You can load the patterns like...

Thanks for the report, I can reproduce the behavior where it hangs. As a workaround, I think it works if you wrap `tqdm` around the texts rather than on `zip`:...

As a note, I've marked this as a bug because it shouldn't hang like this, but since there's an easy workaround it's going to be pretty low priority for us...

Thanks for the report, it's useful to see these kind of test cases. The underlying reasons are interactions with the tokenizer, which isn't splitting off the sentence-final punctuation into separate...

Hi, yes, we definitely need to think about how to make this more flexible for new/custom components. In the meanwhile, the CLI scripts should be pretty easy to copy and...

In terms of adding a new model, we already have similar training corpora set up for the standard Danish models (also DaNE, although possibly an older version) and an internal...

Hi, as a note we'll be adding `da_core_news_trf` for spacy v3.1, initially configured to use `Maltehb/danish-bert-botxo`. For now the pipeline config is basically the same for all `trf` models to...

The basic `spacy train` training loop only supports one GPU. I think in theory you would want to configure `ray` workers so that each was associated with one particular GPU,...

If you have created a model with vectors using `spacy init vectors` (the v3 CLI command for this), you then specify it under `[initialize.vectors]` and set `include_static_vectors = true` for...

Yes, just run [`spacy init vectors`](https://spacy.io/api/cli#init-vectors). The options are in a slightly different format than v2 `spacy init-model`, but it's very similar. Since spacy actually doesn't include any code for...