ichiran icon indicating copy to clipboard operation
ichiran copied to clipboard

[feature request] ichiran-cli segment sentence into dictionary forms

Open Sakusuke opened this issue 3 years ago • 2 comments

Requesting a flag which turns the input of a sentence like: "昨日すき焼きを食べました" into "昨日;すき焼き;を;食べる". Needing this for automation with a command line dictionary. -i and -f already kinda do this but getting the dictionary forms from their output is a bit of a hassle and they take quite a bit longer to execute than a normal 'simple-segment' because of the included definitions (atleast that's what I'm guessing). Also this seems like a common enough usecase to warrent a flag.

Sakusuke avatar Oct 05 '21 13:10 Sakusuke

This operation is not well defined, i.e. きました -> くる(来る) or きる(着る). Also seems pretty difficult to implement in general, but I haven't thought about it very hard.

tshatrov avatar Oct 06 '21 08:10 tshatrov

Very good point. The perfect solution would be analyzing context, but since that's way out of scope resorting to frequency would be good enough.

this sentence wouldn't work correctly without the space, but it merely serves as an example of how the flag's output could look like

$ ichiran-cli -foobar 彼は きました
彼
は
来る;着る

Sakusuke avatar Oct 06 '21 14:10 Sakusuke