sudachi.rs
sudachi.rs copied to clipboard
Sudachi in Rust 🦀 and new generation of SudachiPy
Hi, thanks for developing this great library. I played a bit with R ([my repo](https://github.com/yutannihilation/fledgingr)), but I had to struggle with this error when I tried to make a static...
I noticed that if input is too long in Python an Exception is thrown, but it's a plain Exception, not a `ValueError` or something. I see in the Rust code...
Similar to Java Config API
`-d print the debug information`: It does not exist (yet)? ```sh sudachipy tokenize --help usage: sudachipy tokenize [-h] [-r file] [-m {A,B,C}] [-o file] [-s string] [-a] [-d] [-v] [file...
In the help text, it says that `-s file system dictionary path (default: system core dictionary path)`; ```sh $ sudachipy ubuild --help usage: sudachipy ubuild [-h] [-d string] [-o file]...
We want to design public API so Sudachi would be usable like the following. Syntax can be a bit invalid and all names are open for discussion. ```rust let model...
Also need to check if two analyses with the same `Tokenizer` object are occurring concurrently and raise an error in that case.
I have noticed that Sudachi Py and Sudachi behave differently because "İstanbul" is not recognized as a single token in SudachiPy, so I will report it. ``` $ echo "İstanbul"...