Shuhei Iitsuka
Shuhei Iitsuka
When I run the notebook for Chapter 2 on pymc2, it fails with the error below. It looks like current implementation is not compatible with numpy boolean subtract. python version:...
example input: 今日は [@foo]tushuhei.com/hoge[/@foo]天気です。 output: ``` 今日は [@foo]tushuhei.com/hoge[/@foo]天気です。 ``` It seems characters like [, @ are included in a chunk by error.
The current implementation keeps returning the warning below. ``` DeprecationWarning: This method will be removed in future versions. Use 'list(elem)' or iteration over elem instead. ```
Add [Jieba](https://github.com/fxsjy/jieba) backend segmenter to add another segmenter option for Chinese.
Completes the script as a command, not to make junk directories such as `cd` and `clone` by copying and pasting this.
Consider to add the `--lang` option to the JavaScript CLI tool similar to #55
##### Checklist - [X] Required: Issue filed: https://unicode-org.atlassian.net/browse/ICU-22666 - [X] Required: The PR title must be prefixed with a JIRA Issue number. - [X] Required: The PR description must include...