lute-v3 icon indicating copy to clipboard operation
lute-v3 copied to clipboard

Add language: Modern Chinese

Open fanyingfx opened this issue 1 year ago • 2 comments

  • Add modern chinese parser that can detect Chinese mutiple-character-words and some test cases.
  • Add langauge config and the story.

By default using the pkuseg for parsing Chinese, if user want a more accurate parser, can install hanlp by pip.

Have created a PR for this

fanyingfx avatar Dec 27 '23 07:12 fanyingfx

Seriously, it is not called “Modern Chinese”. Maybe you mean “Mandarin Chinese”. Other Chinese dialects have largely different vocabularies.

GrimPixel avatar Feb 11 '24 00:02 GrimPixel

Seriously, it is not called “Modern Chinese”. Maybe you mean “Mandarin Chinese”. Other Chinese dialects have largely different vocabularies.

We already have 'Classical Chinese'. So, in that context, it's fair to say 'modern'. Probably best to not debate that stuff here. :)

M-Biggles avatar Feb 11 '24 14:02 M-Biggles

Mandarin is added :-), old issue, closing.

jzohrab avatar Jun 13 '24 20:06 jzohrab