spaCy
spaCy copied to clipboard
๐ซ Industrial-strength Natural Language Processing (NLP) in Python
## How to reproduce the behaviour [Colab notebook demonstrating problem](https://colab.research.google.com/drive/14FFYKqjRVRbN7aAVmHUYEao9CwahY0We?usp=sharing) When parsing a sentence that contains newlines, the Italian parser sometimes assigns the newline to a sentence by itself, for...
## How to reproduce the behaviour import spacy nlp = spacy.blank('ml') doc = nlp('เดเดจเตเดคเตเดฏเดฏเดฟเตฝ เดเตเดฐเดณ เดธเดเดธเตเดฅเดพเดจเดคเตเดคเดฟเดฒเตเด เดเตเดจเตเดฆเตเดฐเดญเดฐเดฃเดชเตเดฐเดฆเตเดถเดเตเดเดณเดพเดฏ เดฒเดเตเดทเดฆเตเดตเตเดชเดฟเดฒเตเด เดชเตเดฃเตเดเดฟเดเตเดเตเดฐเดฟเดฏเตเดเต เดญเดพเดเดฎเดพเดฏ เดฎเดพเดนเดฟเดฏเดฟเดฒเตเด เดคเดฎเดฟเดดเตเดจเดพเดเตเดเดฟเดฒเต เดเดจเตเดฏเดพเดเตเดฎเดพเดฐเดฟ เดเดฟเดฒเตเดฒเดฏเดฟเดฒเตเด เดจเตเดฒเดเดฟเดฐเดฟ เดเดฟเดฒเตเดฒเดฏเดฟเดฒเต เดเตเดกเดฒเตเดฒเตเตผ เดคเดพเดฒเตเดเตเดเดฟเดฒเตเด เดธเดเดธเดพเดฐเดฟเดเตเดเดชเตเดชเตเดเตเดจเตเดจ เดญเดพเดทเดฏเดพเดฃเต...
At the time of writing it is not possible to seamlessly access the prediction scores for an Entity Linking model (spaCy 3.4). This impacts the ability to conduct misclassification analyses,...
## How to reproduce the behaviour I originally encountered this issue when attempting to score a simple regex-based span prediction approach as a baseline. My reference documents contain 286 spans...
## How to reproduce the behaviour code: > text1="My name is robert" + " \r\n " * 1000 + "My name is robert" > text2="My name is robert " +...
It seems that while there is support for tokenization with diacritics in spaCy, the project doesn't lemmatize/morph/pos tag correctly when they are used. ## How to reproduce the behaviour ```...
In many cases, I do not see the label at the beginning of a span in displacy. This should not be an issue with a particular label since I sometimes...
Hi, I noticed from #12328 that spaCy has switched to `pymecab-ko` for the Korean tokenizer in the upcoming `spaCy` 4.0, but there seems to be some installation/import issues of this...
### Discussed in https://github.com/explosion/spaCy/discussions/12301 Originally posted by **mbrunecky** February 18, 2023 I am training NER using transformer model. On one of my data sets, during epoch 2, the score reaches...
Even though [TPL_ENT](https://github.com/explosion/spaCy/blob/master/spacy/displacy/templates.py#L51) can be overridden using `template` option of a renderer there is no way to override [TPL_ENTS](https://github.com/explosion/spaCy/blob/2d4fb94ba0a23523cc9adb65e0dcf92bbf6177b6/spacy/displacy/templates.py#L46) template which might be useful in some cases. I believe it...