spaCy
spaCy copied to clipboard
💫 Industrial-strength Natural Language Processing (NLP) in Python
The workflow for setting up a pipeline component in code sometimes feels a bit rough. This came up while I was investigating #6958. Let's say we have some pipeline component...
Hi Thansk for this nice work. @honnibal I am interested to use any of the pre-trained language model to calculate Perplexity score of a sentence. Is there any way to...
Hi, I'm attempting to initialize the [gpt2-xl huggingface model](https://huggingface.co/gpt2-xl) in SpaCy using the following code provided in [examples/init_model.py](https://github.com/explosion/spacy-transformers/blob/master/examples/init_model.py): ``` #!/usr/bin/env python import plac from wasabi import Printer from spacy_transformers import...
Hi! Is it possible to define a custom loss function when training spacy's transformers? Specifically, I'm asking because the classes in my dataset are imbalanced, and I'd like to weight...
## How to reproduce the behaviour ``` import spacy nlp = spacy.load("en_core_web_sm") list(nlp.tokenizer("about the P&L.")) ``` I get ``` [about, the, P&L.] ``` The . should be separated from P&L...
Would'nt usage of Mmap could really speed up data loading and reduce memory usage in multiprocess environment ? 1. In our webserver we use different process. Each process load three...
With options options = {"fine_grained" : True} it is possible to render the stts-Tags under the Token visualisation. But is it also possible to visualize the stts-Tags and POS Tags...
I'm using the nightly version, I have successfully trained a transformer based NER model and saved it; now I'm trying to resume training on it. Firstly, I'm not sure if...
Hi, I'm using Spacy 2.2.2 to train new tagger and parser models for the Italian language. My training data set is quite big (about 2.3 GB for the train and...
The new Entity Linking code has not been tested/adapted to GPU yet, cf Matt's comment [here](https://github.com/explosion/spaCy/pull/3864#issuecomment-504370194).