spaCy issues

Removing (token level) entity information from doc.

2

## How to reproduce the behaviour EntityRuler is run as in example from docs here https://spacy.io/usage/rule-based-matching#entityruler-ent-ids Suppose someone has a series of pipeline components that run after some entities are...

galtay

bug

feat / doc

Add visualisations for parsed documents

1

## Description This will add three visualisations of information from parsed documents: - a table with rows for consecutive tokens in the document; columns are feature values and/or dependency trees...

richardpaulhudson

enhancement

⚠️ wip

feat / visualizers

OOM with a lot of memory untouched

7

## The problem I am training a sentence classification model using a transformer and a pipeline that is based on the default config. I am doing it on the custom...

jakwisn

gpu

perf / memory

feat / transformer

German adjectives ending on `-e` are not lemmatized using the lookup lemmatizer

14

## How to reproduce the behaviour import spacy nlp = spacy.load('de') s1 = 'Der schöne Garten' doc = nlp(s1) [(t, t.lemma_) for t in doc] >> [(Der, 'der'), (schöne, 'schöne'),...

SuzanaK

enhancement

lang / de

help wanted (easy)

feat / lemmatizer

JSON dump error during meta.json dump in case of NaN values for losses

8

## How to reproduce the behaviour I'm trying to train a text classifier and at the first try I always got `OverflowError: Invalid Nan value when encoding double`. Turns out...

samehraban

feat / serialize

I cannot initialise EntityRecognizer object with no examples, like in the docs

1

## How to reproduce the behaviour ``` cfg = {"model": DEFAULT_NER_MODEL} model = registry.resolve(cfg, validate=True)["model"] ner = EntityRecognizer(nlp.vocab, model) ner.initialize(lambda: [], nlp=nlp) ``` The error I get: ``` TypeError: [E930]...

dataqa

docs

feat / ner

EntityRecognizer throws IndexError when used in pipeline with Transformer and custom span getter

2

EntityRecognizer throws IndexError when used in pipeline with Transformer and custom span getter during training: ``` File "/home/---/---/research_spacy_ru/.venv/lib/python3.8/site-packages/spacy/language.py", line 1122, in update proc.update(examples, sgd=None, losses=losses, **component_cfg[name]) File "spacy/pipeline/transition_parser.pyx", line 416,...

tomateit

feat / ner

feat / transformer

setting an extensions attribute in one span changes it in the other

1

## Problem I am working with a two-level NER taxonomy, where I store the first one in `Span.label_` attribute, and the second one in an extension `Span._.type`. I have annotations...

DSLituiev

bug

duplicate

feat / doc

DependencyMatcher fails on sents when tokens have extension attributes set to ents

2

I'm trying to perform relationship extraction between named entities where the named entities span multiple tokens. I've chosen not to merge the entities as that screws up the dependency parsing....

JohnBurant

feat / doc

Wrong lemma in parsed text

1

Version: spaCy 3.2. The lemma for the word "substantially" is "you" for some reason: ``` >>> import spacy >>> nlp = spacy.load("en_core_web_lg") >>> doc=nlp("The opportunity may not be huge for...

gRURgR

bug

lang / en

models

spaCy
spaCy copied to clipboard

Metadata

Removing (token level) entity information from doc.

Add visualisations for parsed documents

OOM with a lot of memory untouched

German adjectives ending on `-e` are not lemmatized using the lookup lemmatizer

JSON dump error during meta.json dump in case of NaN values for losses

I cannot initialise EntityRecognizer object with no examples, like in the docs

EntityRecognizer throws IndexError when used in pipeline with Transformer and custom span getter

setting an extensions attribute in one span changes it in the other

DependencyMatcher fails on sents when tokens have extension attributes set to ents

Wrong lemma in parsed text

← Metadata

Owner

Metadata

spaCy spaCy copied to clipboard

Metadata

← Metadata

Owner

Metadata

spaCy
spaCy copied to clipboard