corpus topic
awesome-persian-nlp-ir
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
NLP_bahasa_resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Company-Names-Corpus
公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。
indonesian-NLP-resources
data resource untuk NLP bahasa indonesia
Wordless
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Chinese-Names-Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Chinese-NLP-Corpus
Collections of Chinese NLP corpus