Jonathan Washington

Results 239 comments of Jonathan Washington

The amount of memory it uses grows as well. On the first few iterations it uses about 55MB, by 1000 iterations it's up to double that.

> Note that adding the dependency labels / arcs to the out put kaz-tagger and kaz-disam (as currently is the case) breaks translators Is this something that would benefit from...

> Especially if the transducer's not weighted and it will just take one analysis in a greedy manner and go on with that. I don't think there's ever ambiguity with...

I think your examples are okay, though I'm probably not the person to ask. So what do you propose for the two analyses of a form like "оқыған жок"? And...

@mansayk, thank you for sharing your view on this—it's very helpful. I'd just like to clarify one point. You say: > Jumping all the time through the file is not...

Okay, I have a better sense now of what the reasoning is. These are valid reasons, and I've experienced these issues myself. I like Fran's proposal—to keep "open" and "closed"...

> I would suggest to place LEXICON Open in the very end of the file, so it is easier to find where it ends when we sort it. I'm used...

Note, a GCI student wrote a lexc parser and lexicon deduplicator a couple years ago. Let me know if you want help digging it up.

Relevant tools: https://github.com/apertium/apertium-on-github/issues/51