Alexander Nadeau
Alexander Nadeau
Kuromoji now understands the user dictionary at a basic level. This should fix some parses. There's some caveats to how I format the dictionary for it right now (I treat...
I might change Kuromoji over to the Unidic version. The reason I didn't do so before is because it's bigger, but the Unidic parsing is a lot better than the...
The dictionary format kuromoji needs is pre-deconjugated chunks, and with weights to distinguish likely words, so loading edict stuff into it is out. >On another note, running Kuromoji with user...
I rewrote the word splitter again several days ago, and I've used it for long enough that I'm convinced it works the same way it used to. https://github.com/wareya/Spark-Reader/blob/kuromoji/src/language/splitter/WordSplitter.java#L86 The user...
I'm not sure whether it should be MVC or something else, but anything to restructure the program to fix the UI code leakage problem would be a huge plus. I'm...
Maybe splitting it into two different menu items would be a good idea after all. That way the user doesn't have to worry about what spark reader is doing under...
There's also the issue where sometimes a single word has multiple valid definitions but they're way down in the list. This happens a lot with normal words spelled in kana....
>I suppose you're suggesting to provide visible options for storing the first and third, and not deal with the second. Yep, that's right. This is a hard problem, so take...
Sounds good. I'll look at what I have. >Perhaps some sort of option for a more advanced mode could exist, but keeping that compatible with the basic mode... The basic...
I've been thinking about this, and I think I've changed my opinion. Preferred definitions should be based on whatever links them to the definition, which is the deconjugated form. And...