devon
devon copied to clipboard
Uzbek NLP library
Bumps [wheel](https://github.com/pypa/wheel) from 0.23.0 to 0.38.1. Changelog Sourced from wheel's changelog. Release Notes UNRELEASED Updated vendored packaging to 22.0 0.38.4 (2022-11-09) Fixed PKG-INFO conversion in bdist_wheel mangling UTF-8 header values...
Bumps [pyyaml](https://github.com/yaml/pyyaml) from 3.11 to 5.4. Changelog Sourced from pyyaml's changelog. 5.4 (2021-01-19) yaml/pyyaml#407 -- Build modernization, remove distutils, fix metadata, build wheels, CI to GHA yaml/pyyaml#472 -- Fix for...
Develop a feature to do stemming for Cyrillic words.
Currently, segmentizing algorithm does not recognize o' g' ch sh . add a logic to consider them.
Some words can not be stemmed: bodring, shudring... Make a list of this kind of words.
by applying the same methodology as we applied for nouns, let us do stemming for verbs and adjectives