pymystem3
pymystem3 copied to clipboard
A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projec...
Hi. I want to use this morph analyze for my project, but i get some error. ``` Python 3.7.3 (default, Jun 19 2019, 01:54:03) [GCC 6.3.0 20170516] on linux Type...
pymystem3 = "^0.2.0" Steps to reproduce: 1. Run mystem.lemmatize() on string longer than 65535 chars in celery worker. 2. Get exception ``` (yexception) util/charset/wide.h:295: failed to decode UTF-8 string at...
I found an interesting bug: the same words in a phrase, but with different positions, affect the output  
Bumps [py](https://github.com/pytest-dev/py) from 1.4.34 to 1.10.0. Changelog Sourced from py's changelog. 1.10.0 (2020-12-12) Fix a regular expression DoS vulnerability in the py.path.svnwc SVN blame functionality (CVE-2020-29651) Update vendored apipkg: 1.4...
Hello. I try to launch mystem in docker with multiprocessing and get error: OSError: [Errno 26] Text file busy: '/root/.local/bin/mystem' Dockerfile FROM python:3.8-buster WORKDIR /usr/src/app COPY . . RUN python3.8...
Hi. I have a question. I've got a bunch of large .csv files and parse it via multiprocessing Pool ```python res = [pool.apply_async(func=func, args=(file,)) for file in files] ``` So...
Bumps [pygments](https://github.com/pygments/pygments) from 1.6 to 2.7.4. Release notes Sourced from pygments's releases. 2.7.4 Updated lexers: Apache configurations: Improve handling of malformed tags (#1656) CSS: Add support for variables (#1633, #1666)...
Bumps [jinja2](https://github.com/pallets/jinja) from 2.7.1 to 2.11.3. Release notes Sourced from jinja2's releases. 2.11.3 This contains a fix for a speed issue with the urlize filter. urlize is likely to be...
Due to the new arguments and capabilities in the analyze/lemmatize function