spaCy icon indicating copy to clipboard operation
spaCy copied to clipboard

๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python

Results 210 spaCy issues
Sort by recently updated
recently updated
newest added

## How to reproduce the behaviour [Colab notebook demonstrating problem](https://colab.research.google.com/drive/14FFYKqjRVRbN7aAVmHUYEao9CwahY0We?usp=sharing) When parsing a sentence that contains newlines, the Italian parser sometimes assigns the newline to a sentence by itself, for...

lang / it
feat / senter

## How to reproduce the behaviour import spacy nlp = spacy.blank('ml') doc = nlp('เด‡เดจเตเดคเตเดฏเดฏเดฟเตฝ เด•เต‡เดฐเดณ เดธเด‚เดธเตเดฅเดพเดจเดคเตเดคเดฟเดฒเตเด‚ เด•เต‡เดจเตเดฆเตเดฐเดญเดฐเดฃเดชเตเดฐเดฆเต‡เดถเด™เตเด™เดณเดพเดฏ เดฒเด•เตเดทเดฆเตเดตเต€เดชเดฟเดฒเตเด‚ เดชเต‹เดฃเตเดŸเดฟเดšเตเดšเต‡เดฐเดฟเดฏเตเดŸเต† เดญเดพเด—เดฎเดพเดฏ เดฎเดพเดนเดฟเดฏเดฟเดฒเตเด‚ เดคเดฎเดฟเดดเตเดจเดพเดŸเตเดŸเดฟเดฒเต† เด•เดจเตเดฏเดพเด•เตเดฎเดพเดฐเดฟ เดœเดฟเดฒเตเดฒเดฏเดฟเดฒเตเด‚ เดจเต€เดฒเด—เดฟเดฐเดฟ เดœเดฟเดฒเตเดฒเดฏเดฟเดฒเต† เด—เต‚เดกเดฒเตเดฒเต‚เตผ เดคเดพเดฒเต‚เด•เตเด•เดฟเดฒเตเด‚ เดธเด‚เดธเดพเดฐเดฟเด•เตเด•เดชเตเดชเต†เดŸเตเดจเตเดจ เดญเดพเดทเดฏเดพเดฃเต...

bug
lang / ml

At the time of writing it is not possible to seamlessly access the prediction scores for an Entity Linking model (spaCy 3.4). This impacts the ability to conduct misclassification analyses,...

enhancement
feat / nel

## How to reproduce the behaviour I originally encountered this issue when attempting to score a simple regex-based span prediction approach as a baseline. My reference documents contain 286 spans...

feat / scorer

## How to reproduce the behaviour code: > text1="My name is robert" + " \r\n " * 1000 + "My name is robert" > text2="My name is robert " +...

perf / accuracy

It seems that while there is support for tokenization with diacritics in spaCy, the project doesn't lemmatize/morph/pos tag correctly when they are used. ## How to reproduce the behaviour ```...

lang / ru
lang / uk

In many cases, I do not see the label at the beginning of a span in displacy. This should not be an issue with a particular label since I sometimes...

feat / visualizers

Hi, I noticed from #12328 that spaCy has switched to `pymecab-ko` for the Korean tokenizer in the upcoming `spaCy` 4.0, but there seems to be some installation/import issues of this...

lang / ko

### Discussed in https://github.com/explosion/spaCy/discussions/12301 Originally posted by **mbrunecky** February 18, 2023 I am training NER using transformer model. On one of my data sets, during epoch 2, the score reaches...

bug
feat / ner
perf / memory
feat / training
feat / transformer

Even though [TPL_ENT](https://github.com/explosion/spaCy/blob/master/spacy/displacy/templates.py#L51) can be overridden using `template` option of a renderer there is no way to override [TPL_ENTS](https://github.com/explosion/spaCy/blob/2d4fb94ba0a23523cc9adb65e0dcf92bbf6177b6/spacy/displacy/templates.py#L46) template which might be useful in some cases. I believe it...

enhancement
feat / visualizers