hankcs
hankcs
HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
pyhanlp
中文分词
AhoCorasickDoubleArrayTrie
An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
CS224n
CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017
hanlp-lucene-plugin
HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
ID-CNN-CWS
Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"
multi-criteria-cws
Simple Solution for Multi-Criteria Chinese Word Segmentation
TreebankPreprocessing
Python scripts preprocessing Penn Treebank and Chinese Treebank
Viterbi
An implementation of HMM-Viterbi Algorithm 通用的维特比算法实现