spaCy icon indicating copy to clipboard operation
spaCy copied to clipboard

💫 Industrial-strength Natural Language Processing (NLP) in Python

Results 210 spaCy issues
Sort by recently updated
recently updated
newest added

## How to reproduce the behaviour When initializing my configuration file, I set my `train` and `dev` paths as follows in my base_config.cfg: ``` train = 'data/train.spacy' dev = 'data/valid.spacy'...

docs
feat / config

## How to reproduce the behaviour ``` import spacy nlp = spacy.load("de_dep_news_trf") assert nlp("Du ißt Äpfel")[1].lemma_ == 'essen' print(nlp("Du isst Äpfel")[1].lemma_) ``` This prints `isst` where `essen` would be expected....

lang / de
feat / lemmatizer

## How to reproduce the behaviour The uniqueness of `Span` is determined based on `(start, end, label_, kb_id_)`. Underscore class stores extensions only based on `(start, end)`. Any thoughts on...

feat / doc

#### Objective To train custom NER on our own dataset using transformers pipeline. We have 15k long documents and have tried different training settings such as `max_length` range -> `128,...

bug
training
feat / transformer

## How to reproduce the behaviour Go to the [Rule-based Matcher Explorer and set a length based rule (included in the link)](https://explosion.ai/demos/matcher?text=A%20match%20is%20a%20tool%20for%20starting%20a%20fire.%20Typically%2C%20modern%20matches%20are%20made%20of%20small%20wooden%20sticks%20or%20stiff%20paper.%20One%20end%20is%20coated%20with%20a%20material%20that%20can%20be%20ignited%20by%20frictional%20heat%20generated%20by%20striking%20the%20match%20against%20a%20suitable%20surface.%20Wooden%20matches%20are%20packaged%20in%20matchboxes%2C%20and%20paper%20matches%20are%20partially%20cut%20into%20rows%20and%20stapled%20into%20matchbooks.&model=en_core_web_sm&pattern=%5B%7B%22id%22%3A2%2C%22attrs%22%3A%5B%7B%22name%22%3A%22LENGTH%22%2C%22value%22%3A%222%22%7D%5D%7D%5D) No matches are shown. I would expect that...

bug
feat / visualizers

The `iob_to_biluo` and `biluo_to_iob` functions are useful but they don't show up in the official docs. Would be nice if they can be updated, I can submit a PR if...

docs
feat / training

I am going through the process of creating a custom ner model after having converted my annotated files to spaCy format. I am using the widget at [the relevant spaCy...

docs

There's most likely a bug in spaCy v2 around the NER resizing, entity rulers in the pipeline, and IO+(re)initialization. Background in this discussion thread: https://github.com/explosion/spaCy/discussions/8864. There are several entity rulers...

bug
feat / ner
feat / serialize
v2

This is for the issue found in https://github.com/explosion/spaCy/discussions/7564. ## How to reproduce the behaviour Given a sentence, set `is_sent_start` to False in some but not all of the tokens before...

bug
feat / parser

## How to reproduce the behaviour The project templates allow [overriding variables](https://spacy.io/usage/projects#custom-scripts) via the CLI as `--vars.foo`, which uses the same mechanism we use for CLI config overrides. Under the...

bug
third-party
feat / cli
projects