ichiran icon indicating copy to clipboard operation
ichiran copied to clipboard

Linguistic tools for texts in Japanese language

Results 16 ichiran issues
Sort by recently updated
recently updated
newest added

I'm using ichiran-cli with the `-f` argument to provide my scripts with the full segmentation of a sentence. I'm writing my own little parser for the returned JSON, but I'm...

Hiya, I noticed that there doesn't seem to be support for the [verb masu stem] + がい + structure, and ichi.moe instead would try to parse this as everything except...

I'd like to integrate segmentation into an app I wrote. As far as I know, it's not possible to run Lisp on Android. Is there some kind of documentation that...

I noticed that there are some inconsistencies with how whitespace and punctuation are treated, and it causes some precision issues when trying to correlate with the original sentence. For example,...

I'm using Ichiran is because it is, by far, **the best** parser/tokenizer at when it comes to reasonable word boundaries in Japanese. It is so awesome! I have noticed, however,...

Hi, (this doesn't really belong in a bug report but I'd still like to take a second to say that what you've done here is fabulous, amazing, and incredibly helpful....

I'm curious if the maintainers would have any interest in a conversion from Postgresql to SQLite. The idea would be to Ichiran much more operationally simple, allowing it to be...

Requesting a flag which turns the input of a sentence like: "昨日すき焼きを食べました" into "昨日;すき焼き;を;食べる". Needing this for automation with a command line dictionary. -i and -f already kinda do this...

Lately the few placenames etc. that exist in jmdict are being moved to jmnedict. If this continues, ichi.moe won't be able to recognize stuff like Tokyo etc., which is unacceptable....

Hi, I noticed that ichiran will currently segment "どこから見ても" as three separate words "どこ", "から" and "見ても", rather than the expression which has a [JMdict entry](http://www.edrdg.org/jmdictdb/cgi-bin/entr.py?svc=jmdict&sid=&e=2196144). Same is true for...