spaCy icon indicating copy to clipboard operation
spaCy copied to clipboard

Process finished with exit code -1073741819 (0xC0000005)

Open hosford42 opened this issue 4 months ago • 8 comments

How to reproduce the behaviour

nlp = spacy.load('en_core_web_lg')
with open('data/1971 Davis Cup.txt', encoding='utf-8') as file:
    for line in file:
        line = line.strip()
        if not line:
            continue
        doc = nlp(line)
        tokens = [token.text for token in doc]
        dependencies = set()
        for token in doc:
            for child in token.children:
                dependencies.add((token.i, token.dep_, child.i))
        print(tokens, dependencies)

The error occurs during parsing of the 2nd line of the file.

1971 Davis Cup.txt

Your Environment

  • spaCy version: 3.8.2
  • Platform: Windows-11-10.0.22631-SP0
  • Python version: 3.12.7
  • Pipelines: en_core_web_lg (3.8.0), en_core_web_sm (3.8.0)
Package              Version
-------------------- ---------
annotated-types      0.7.0
beautifulsoup4       4.12.3
blinker              1.8.2
blis                 1.0.1
catalogue            2.0.10
certifi              2024.8.30
charset-normalizer   3.3.2
click                8.1.7
cloudpathlib         0.19.0
colorama             0.4.6
confection           0.1.5
cymem                2.0.8
dash                 2.18.1
dash-core-components 2.0.0
dash-html-components 2.0.0
dash-table           5.0.0
en_core_web_lg       3.8.0
en_core_web_sm       3.8.0
filelock             3.16.1
Flask                3.0.3
fsspec               2024.9.0
gensim               4.3.3
graphviz             0.20.3
idna                 3.10
importlib_metadata   8.5.0
itsdangerous         2.2.0
Jinja2               3.1.4
langcodes            3.4.1
language_data        1.2.0
marisa-trie          1.2.1
markdown-it-py       3.0.0
MarkupSafe           2.1.5
mdurl                0.1.2
mpmath               1.3.0
murmurhash           1.0.10
neo4j                5.25.0
nest-asyncio         1.6.0
networkx             3.4.1
numpy                1.26.4
packaging            24.1
pip                  23.2.1
plotly               5.24.1
preshed              3.0.9
pydantic             2.9.2
pydantic_core        2.23.4
Pygments             2.18.0
pytz                 2024.2
requests             2.32.3
retrying             1.3.4
rich                 13.9.2
scipy                1.13.1
setuptools           75.1.0
shellingham          1.5.4
six                  1.16.0
smart-open           7.0.5
soupsieve            2.6
spacy                3.8.2
spacy-legacy         3.0.12
spacy-loggers        1.0.5
srsly                2.4.8
sympy                1.13.3
tenacity             9.0.0
thinc                8.3.2
torch                2.4.1
tqdm                 4.66.5
typer                0.12.5
typing_extensions    4.12.2
urllib3              2.2.3
wasabi               1.1.3
weasel               0.4.1
Werkzeug             3.0.4
wikipedia            1.4.0
wrapt                1.16.0
zipp                 3.20.2

hosford42 avatar Oct 13 '24 04:10 hosford42