apertium-kaz
apertium-kaz copied to clipboard
Apertium linguistic data for Kazakh
can I user your program as a stemmer or/and lemmatizer for Kazakh language?
You can merge this at the very end when GSoC period ends (or even later when I finish doing all the requirements).
https://github.com/apertium/apertium-kaz/pull/16 convolutes changes to the Constraint Grammar with the corrections in the UD treebank. To make merging easier/faster, I decided to make a separate PR out of the latter. I...
General context: https://github.com/apertium/apertium-kaz/pull/17 Actually several related issues: 1. `мың` and `миллиард` are `NUM num` everywhere, while `миллион` in some cases is `NUM num`, and in others `NOUN n`. 2. `млрд.`...
Include c → с
Similar to #10, Kazakh has the issue of two `neg.ifi paradigms`. First-person singular (`neg.ifi.p1.sg`) looks like this: - мен барған жоқпын - мен бармадым The question is whether there is...
``` hfst-fst2strings -c 1 .deps/NUM.hfst | gzip -c > NUM.txt.gz Killed make: *** No rule to make target 'NUM-ROMAN.txt.gz', needed by 'all'. Stop. rm .deps/NUM.hfst .deps/NUM.prefix.bin .deps/NUM.prefix.upper .deps/NUM.prefix.att .deps/NUM.prefix.hfst .deps/NUM.prefixes...