chinese-nlp topic
berserker
Berserker - BERt chineSE woRd toKenizER
DeepDiveChineseApps
DeepDive Tutorial with Chinese Support
punctuator
A small seq2seq punctuator tool based on DistilBERT
easy-bert
easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习
StanfordCoreNLP_Chinese
Chinese implementation of the Python official interface for Stanford CoreNLP Java server application to parse, tokenize, part-of-speech tag, etc. Chinese texts.
bert_tokenization_for_java
This is a java version of Chinese tokenization descried in BERT.
zi-dataset
汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。
idiom-database
成语数据库,成语接龙数据库,拥有30000+个成语,可直接使用首拼音和尾拼音编写自己的成语接龙
classic_chinese_punctuate
classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset