klpt
klpt copied to clipboard
The Kurdish Language Processing Toolkit
Note that there is a treebank for Kurmanji at https://github.com/UniversalDependencies/UD_Kurmanji-MG this can be used for training a part of speech tagger and dependency parser.
The `Configuration` module needs to be properly integrated into each single function in all other modules. This is not the case for `Stem` module, for instance. https://github.com/sinaahmadi/klpt/blob/d540c52a7cf86e249ab0c9cc9f6d7b524e31d6a9/klpt/configuration.py#L18
whatever i do cyhunspell fails i wonder if you found a way to work it out
In Kurmanji, words ending with "î" when inflected with a form starting with "î" undergo an alternation where the "îî" becomes "iy" in contrast to "îy". That should be included...
The argument `in_separator` seems to be overwritten at the end of the mwe_tokenize function. The argument is not accessible through `word_tokenizer` either.