Results 26 repositories owned by hankcs

HanLP

32.6k
Stars
9.6k
Forks
Watchers

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

AhoCorasickDoubleArrayTrie

929
Stars
285
Forks
Watchers

An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.

CS224n

668
Stars
274
Forks
Watchers

CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017

hanlp-lucene-plugin

294
Stars
99
Forks
Watchers

HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统

ID-CNN-CWS

136
Stars
40
Forks
Watchers

Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"

multi-criteria-cws

299
Stars
84
Forks
Watchers

Simple Solution for Multi-Criteria Chinese Word Segmentation

TreebankPreprocessing

162
Stars
43
Forks
Watchers

Python scripts preprocessing Penn Treebank and Chinese Treebank

Viterbi

367
Stars
197
Forks
Watchers

An implementation of HMM-Viterbi Algorithm 通用的维特比算法实现

aho-corasick

183
Stars
139
Forks
Watchers

Aho-Corasick的Java实现,针对Ascii优化,支持Unicode。