fasttext topic
cw2vec
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
gensim
Topic Modelling for Humans
text_classification
all kinds of text classification models and more with deep learning
bert_language_understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
pytorch-sentiment-analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
mynlp
一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
nlp-journey
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Tran...
ai_law
all kinds of baseline models for long text classificaiton( text categorization)
wordvectors
Pre-trained word vectors of 30+ languages