Peter Olson comments

Results 37 comments of


                                            Peter Olson

trafficstars

📚 Inaccurate pre-trained model predictions master thread

`zh_core_web_trf` is not detecting sentence boundaries correctly in Chinese. ``` nlp = spacy.load("zh_core_web_trf") doc = nlp("我是你的朋友。你是我的朋友吗？我不喜欢喝咖啡。") ``` This should be three separate sentence, but the `sents` property only has one...

📚 Inaccurate pre-trained model predictions master thread

Spanish tokenization is broken when there is no space between question sentences "?¿" ``` nlp = spacy.load("es_dep_news_trf") doc = nlp("¿Qué quieres?¿Por qué estás aquí?") ``` `quieres?¿Por` is treated as one...

The fourth stroke of the character 體 is not accepted.

Aa now I understand the cause of the issue. Both the user and I assumed that the stroke would go from left-to-right. I guess technically it works as designed, although...

新需求：笔画粗细期望可以调整

This can be changed with the [`drawingWidth` property.](https://hanziwriter.org/docs.html#api-link)

Stroke order of 翰 is wrong

Same issue with 肠

Stroke order of 翰 is wrong

There's been issues open on the makemeahanzi for a while already. https://github.com/skishore/makemeahanzi/issues/95 https://github.com/skishore/makemeahanzi/issues/96 If you want to patch this, here is the correct stroke data for 翰 ``` {"strokes":["M 317...

Is there a way to allow shortcuts or alternate stroke orders?

From what I understand, the [Inkstone allows for some shortcuts](https://github.com/skishore/inkstone/blob/master/lib/matcher/shortcuts.js) for some common character components, but I'm not familiar enough with the code to understand exactly how it works. Would...

Peter Olson

📚 Inaccurate pre-trained model predictions master thread

📚 Inaccurate pre-trained model predictions master thread

The fourth stroke of the character 體 is not accepted.

新需求：笔画粗细期望可以调整

Stroke order of 翰 is wrong

Stroke order of 翰 is wrong

Is there a way to allow shortcuts or alternate stroke orders?

Current state of Traditional Chinese support

Does segment support splitting Traditional Chinese into words?

Add the source used for this system