fugashi icon indicating copy to clipboard operation
fugashi copied to clipboard

When building a user dict, check number of fields

Open polm opened this issue 2 years ago • 0 comments

Originally brought up in https://github.com/octanove/janlpbook-code/issues/1.

When building a user dictionary, it would be helpful to check the number of fields and warn or something when they don't match the system dictionary. This is a little tricky because MeCab doesn't actually parse any dictionary fields, so it's valid to just omit everything, for example. This is further complicated since dictionaries don't have explicit schemas. So this would require a bit of making assumptions and departing from the MeCab API, but it would help make more useful user dictionaries.

polm avatar Dec 04 '22 08:12 polm