spaCy
spaCy copied to clipboard
💫 Industrial-strength Natural Language Processing (NLP) in Python
When a [Doc](https://spacy.io/api/doc) object is retokenized, the entity spans in [Doc.ents](https://spacy.io/api/doc#ents) reflect the new token alignment, but the spans in [Doc.spans](https://spacy.io/api/doc#spans) retain the original doc's token indexes, leading to unexpected...
### Discussed in https://github.com/explosion/spaCy/discussions/10861 Using `nlp.rehearse` on a pipelines with a `tok2vec` listener results in `ValueError: [E953] Mismatched IDs`. Originally posted by **nashcaps2255** May 27, 2022 Have a textcat multilabel...
I recently started a new spaCy project and decided not to create a separate Python module with all the custom code. While I can pass a comma-separated list of code...
When running data through the en_core_web_trf model concurrently I am getting different results between runs. I cannot find anywhere in the documentation or other github issues where this behaviour is...
As the span group is meant to support different groups of spans, would be more useful to enable displacy supporting other span keys at the same time? ``` import spacy...
## Description When I ran type checker on the spaCy project, I found that missing argument type error in `create_lemmatizer`. I believe `create_lemmatizer` is no longer used, so I suggest...
I am creating a kb by combining elements of my local graphdb with wikidata. It cannot add a duplicate, but I think it should be able to handle synonyms? so...
Pairs with https://github.com/explosion/spacy-llm/pull/465 over at `spacy-llm`.
## Description I have put this in draft, because it still needs careful review. ### Types of change Docs ## Checklist - [x] I confirm that I have the right...
I had a problem when I used the GPU provided by kaggle to train my Chinese information extraction model, I used the config file generated by the config file generation...