Jonathan Washington
Jonathan Washington
The amount of memory it uses grows as well. On the first few iterations it uses about 55MB, by 1000 iterations it's up to double that.
> Note that adding the dependency labels / arcs to the out put kaz-tagger and kaz-disam (as currently is the case) breaks translators Is this something that would benefit from...
@khannatanmai, see above about secondary tags.
> Especially if the transducer's not weighted and it will just take one analysis in a greedy manner and go on with that. I don't think there's ever ambiguity with...
I think your examples are okay, though I'm probably not the person to ask. So what do you propose for the two analyses of a form like "оқыған жок"? And...
@mansayk, thank you for sharing your view on this—it's very helpful. I'd just like to clarify one point. You say: > Jumping all the time through the file is not...
Okay, I have a better sense now of what the reasoning is. These are valid reasons, and I've experienced these issues myself. I like Fran's proposal—to keep "open" and "closed"...
> I would suggest to place LEXICON Open in the very end of the file, so it is easier to find where it ends when we sort it. I'm used...
Note, a GCI student wrote a lexc parser and lexicon deduplicator a couple years ago. Let me know if you want help digging it up.
Relevant tools: https://github.com/apertium/apertium-on-github/issues/51